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Title : RICIN-LIKE TOXIN VARIANTS FOR TREATMENT OF 
CANCER, VIRAL OR PARASITIC INFECTIONS 

This is a continuation-in-part of U.S. patent application serial 
No. 09/403,752 filed October 29, 1999. 
5 FIELD OF THE INVENTION 

The invention relates to proteins useful as therapeutics 
against cancer, viral infections, parasitic and fungal infections. The proteins 
contain A and B chains of a ricin-like toxin linked by a linker sequence that 
is specifically cleaved and activated by proteases specific to disease- 

10 associated pathogens or cells. 

BACKGROUND OF THE INVENTION 

Bacteria and plants are known to produce cytotoxic proteins 
which may consist of one, two or several polypeptides or subunits. Those 
proteins having a single subunit may be loosely classified as Type I 

15 proteins. Many of the cytotoxins which have evolved two subunit 
structures are referred to as type II proteins (Saelinger, C.B. in Trafficking 
of Bacterial Toxins (eds. Saelinger, CB.) 1-13 (CRC Press Inc., Boca Raton, 
Florida, 1990). One subunit, the A chain, possesses the toxic activity 
whereas the second subunit, the B chain, binds cell surfaces and mediates 

20 entry of the toxin into a target cell. A subset of these toxins kill target cells 
by inhibiting protein biosynthesis. For example, bacterial toxins such as 
diphtheria toxin or Pseudomonas exotoxin inhibit protein synthesis by 
inactivating elongation factor 2. Plant toxins such as ricin, abrin, and 
bacterial toxin Shiga toxin, inhibit protein synthesis by directly inactivating 

25 the ribosomes (Olsnes, S. & Phil, A. in Molecular action of toxins and 
viruses (eds. Cohen, P. & vanHeyningen, S.) 51-105 Elsevier Biomedical 
Press, Amsterdam, 1982). 

Ricin, derived from the seeds of Ricinus communis (castor oil 
plant), may be the most potent of the plant toxins. It is estimated that a 

30 single ricin A chain is able to inactivate ribosomes at a rate of 1500 
ribosomes/minute. Consequently, a single molecule of ricin is enough to 
kill a cell (Olsnes, S. & Phil, A* in Molecular action of toxins and viruses 
(eds. Cohen, P. & vanHeyningen, S.) (Elsevier Biomedical Press, 
Amsterdam, 1982). The ricin toxin is a glycosylated heterodimer consisting 



of A and B chains with molecular masses of 30,625 Da and 31,431 Da linked 
by a disulphide bond. The A chain of ricin has an N-glycosidase activity 
and catalyzes the excision of a specific adenine residue from the 28S rRNA 
of eukaryotic ribosomes (Endo, Y. & Tsurugi, K. J., Biol Chem. 262:8128 
5 (1987)). The B chain of ricin, although not toxic in itself, promotes the 
toxicity of the A chain by binding to galactose residues on the surface of 
eukaryotic cells and stimulating receptor-mediated endocytosis of the toxin 
molecule (Simmons et al., Biol. Chem. 261:7912 (1986)). Once the toxin 
molecule consisting of the A and B chains is internalized into the cell via 
10 clathrin-dependent or independent mechanisms, the greater reduction 
Jt potential within the cell induces a release of the active A chain, eliciting its 

YJl inhibitory effect on protein synthesis and its cytotoxicity (Emmanuel, F. et 

U al., Anal. Biochem. 173: 134-141 (1988); Blum, J.S. et al, /. Biol Chem. 266: 

f J 22091-22095 (1991); Fiani, M.L. et al, Arch. Biochem. Biophys. 307: 225-230 

H 15 (1993)). Empirical evidence suggests that activated toxin {e.g. ricin, shiga 

^ toxin and others) in the endosomes is transcytosed through the trans-Golgi 

HR network to the endoplasmic reticulum by retrograde transport before the 

> A chain is translocated into the cytoplasm to elicit its action (Sandvig, K. & 

D van Deurs, B v FEBS Lett 346: 99-102 (1994). 

20 Protein toxins are initially produced in an inactive, precursor 

form. Ricin is initially produced as a single polypeptide (preproricin) with a 
35 amino acid N-terminal presequence and 12 amino acid linker between 
the A and B chains. The pre-sequence is removed during translocation of 
the ricin precursor into the endoplasmic reticulum (Lord, J.M., Eur. J. 
25 Biochem. 146:403-409 (1985) and Lord, J.M., Eur. ]. Biochem. 146:411-416 
(1985)). The proricin is then translocated into specialized organelles called 
protein bodies where a plant protease cleaves the protein at a linker region 
between the A and B chains (Lord, J.M. et al., FASAB Journal 8:201-208 
(1994)). The two chains, however, remain covalently attached by an 
30 interchain disulfide bond (cysteine 259 in the A chain to cysteine 4 in the B 
chain) and mature disulfide linked ricin is stored in protein bodies inside 
the plant cells. The A chain is inactive in proricin (O'Hare, M. et al., FEBS 
Lett 273:200-204 (1990)) and it is inactive in the disulfide-linked mature ricin 
(Richardson, P.T. et al., FEBS Lett 255:15-20 (1989)). The ribosomes of the 
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castor bean plant are themselves susceptible to inactivation by ricin A 
chain; however, as there is no cell surface galactose to permit B chain 
recognition the A chain cannot re-enter the cell. The exact mechanism of A 
chain release and activation in target cell cytoplasm is not known (Lord, 
5 J.M. et al., FASAB Journal 8:201-208 (1994)). However, it is known that for 
activation to take place the disulfide bond between the A and B chains 
must be reduced and, hence, the linkage between subunits broken. 

Diphtheria toxin is produced by Corynebacterium diphtheriae 
as a 535 amino acid polypeptide with a molecular weight of approximately 

10 58kD (Greenfield, L. et al, Proc. Natl Acad. ScL USA 80:6853-6857 (1983); 
Pastan, I. et aL, Annu, Rev. Biochem. 61:331-354 (1992); Collier, RJ. & 
Kandel, J., J, Biol Chern. 246:1496-1503 (1971)). It is secreted as a single- 
chain polypeptide consisting of 2 functional domains. Similar to proricin, 
the N-terminal domain (A-chain) contains the cytotoxic moiety whereas 

15 the C-terminal domain (B-chain) is responsible for binding to the cells and 
facilitates toxin endocytosis. Conversely, the mechanism of cytotoxicity 
for diphtheria toxin is based on ADP-ribosylation of EF-2 thereby blocking 
protein synthesis and producing cell death. The 2 functional domains in 
diphtheria toxin are linked by an arginine-rich peptide sequence as well as 

20 a disulphide bond. Once the diphtheria toxin is internalized into the cell, 
the arginine-rich peptide linker is cleaved by trypsin-like enzymes and the 
disulphide bond (Cys 186-201) is reduced. The cytotoxic domain is 
subsequently translocated into the cytosol substantially as described above 
for ricin and elicits ribosomal inhibition and cytotoxicity. 

25 Pseudomonas exotoxin is also a 66kD single-chain toxin 

protein secreted by Pseudomonas aeruginosa with a similar mechanism of 
cytotoxicity to that of diphtheria toxin (Pastan, I. et al., Annu. Rev. Biochem. 
61:331-354 (1992); Ogata, M. et al., J. Biol Chem. 267:25396-25401 (1992); 
Vagil, M.L. et al., Infect Immunol 16:353-361 (1977)). Pseudomonas exotoxin 

30 consists of 3 conjoint functional domains. The first domain la (amino acids 
1-252) is responsible for cell binding and toxin endocytosis, a second 
domain II (amino acids 253-364) is responsible for toxin translocation from 
the endocytic vesicle to the cytosol, and a third domain III (amino acids 
400-613) is responsible for protein synthesis inhibition and cytotoxicity. 
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After Pseudomonas exotoxin enters the cell, the liberation of the cytotoxic 
domain is effected by both proteolytic cleavage of a polypeptide sequence 
in the second domain (near Arg 279) and the reduction of the disulphide 
bond (Cys 265-287) in the endocytic vesicles. In essence, the overall 
5 pathway to cytotoxicity is analogous to diphtheria toxin with the exception 
that the toxin translocation domain in Pseudomonas exotoxin is structurally 
distinct. 

Class 2 ribosomal inhibitory proteins (RIP-2) constitute 
other toxins possessing distinct functional domains for cytotoxicity and cell 

10 binding/toxin translocation which include abrin, modeccin, volkensin, 
(Sandvig, K. et aL, Biochem. Soc. Trans, 21:707-711 (1993)) and mistle toe 
lectin (viscumin) (Olsnes, S. & Phil, A. in Molecular action of toxins and 
viruses (eds. Cohen, P. & vanHeyningen, S.) 51-105 Elsevier Biomedical 
Press, Amsterdam, 1982; and Fodstad, et aL Cane. Res. 44:862 (1984)). 

15 Some toxins such as Shiga toxin and cholera toxin also have multiple 
polypeptide chains responsible for receptor binding and endocytosis. 

The ricin gene has been cloned and sequenced, and the 
X-ray crystal structures of the A and B chains have been described 
(Rutenber, E. et aL Proteins 10:240-250 (1991); Weston et aL, Mol Bio. 

20 244:410-422, 1994; Lamb and Lord, Eur. J. Biochem. 14:265 (1985); Hailing, K. 
et aL Nucleic Acids Res. 13:8019 (1985)). Similarly, the genes for diptheria 
toxin and Pseudomonas exotoxin have been cloned and sequenced, and the 
3-dimensional structures of the toxin proteins have been elucidated and 
described (Columblatti, M. et aL, /. Biol Chem. 261:3030-3035 (1986); 

25 Allured, V.S. et aL, Proc. Natl Acad. Sci. USA 83:1320-1324 (1986); Gray, G.L. 
et aL, Proc. Natl Acad. Sci. USA 81:2645-2649 (1984); Greenfield, L. et aL, 
Proc. Natl Acad. Sci USA 80:6853-6857 (1983); Collier, RJ, et aL, J. Biol 
Chem. 257:5283-5285 (1982)). 

The potential of bacterial and plant toxins for inhibiting 

30 mammalian retroviruses, particularly acquired immunodeficiency 
syndrome (AIDS), has been investigated. Bacterial toxins such as 
Pseudomonas exotoxin-A and subunit A of diphtheria toxin; dual chain 
ribosomal inhibitory plant toxins such as ricin, and single chain ribosomal 
inhibitory proteins such as trichosanthin and pokeweed antiviral protein 
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have been used for the elimination of HIV infected cells (Olson et al v AIDS 
Res. and Human Retroviruses 7:1025-1030 (1991)), The high toxicity of these 
toxins for mammalian cells, combined with a lack of specificity of action 
poses a major problem to the development of pharmaceuticals 
5 incorporating the toxins, such as immunotoxins. 

Due to their extreme toxicity there has been much interest 
in making ricin-based immunotoxins as therapeutic agents for specifically 
destroying or inhibiting infected or tumourous cells or tissues (Vitetta et 
al., Science 238:1098-1104(1987)). An immunotoxin is a conjugate of a 

10 specific cell binding component, such as a monoclonal antibody or growth 
factor and the toxin in which the two protein components are covalently 
linked. Generally, the components are chemically coupled. However, the 
linkage may also be a peptide or disulfide bond. The antibody directs the 
toxin to cell types presenting a specific antigen thereby providing a 

15 specificity of action not possible with the natural toxin. Immunotoxins have 
been made both with the entire ricin molecule (i.e. both chains) and with 
the ricin A chain alone (Spooner et al., Mol. Immunol 31:117-125, (1994)). 

Immunotoxins made with the ricin dimer (IT-Rs) are more 
potent toxins than those made with only the A chain (IT-As). The 

20 increased toxicity of IT-Rs is thought to be attributed to the dual role of the 
B chains in binding to the cell surface and in translocating the A chain to the 
cytosolic compartment of the target cell (Vitetta et al., Science 238:1098- 
1104 (1987); Vitetta & Thorpe, Seminars in Cell Biology 2:47-58 (1991)). 
However, the presence of the B chain in these conjugates also promotes 

25 the entry of the immunotoxin into nontarget cells. Even small amounts of 
B chain may override the specificity of the cell-binding component as the B 
chain will bind nonspecifically to galactose associated with N-linked 
carbohydrates, which is present on most cells. IT-As are more specific and 
safer to use than IT-Rs. However, in the absence of the B chain the A chain 

30 has greatly reduced toxicity. Due to the reduced potency of IT-As as 
compared to IT-Rs, large doses of IT-As must be administered to patients. 
The large doses frequently cause immune responses and production of 
neutralizing antibodies in patients (Vitetta et al., Science 238:1098-1104 
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(1987)). IT- As and IT-Rs both suffer from reduced toxicity as the A chain is 
not released from the conjugate into the target cell cytoplasm. 

A number of immunotoxins have been designed to 
recognize antigens on the surfaces of tumour cells and cells of the immune 
5 system (Pastan et aL, Annals New York Academy of Sciences 758:345-353 
(1995)). A major problem with the use of such immunotoxins is that the 
antibody component is its only targeting mechanism and the target 
antigen is often found on non-target cells (Vitetta et al., Immunology Today 
14:252-259 (1993)). Also, the preparation of a suitable specific cell binding 

10 component may be problematic. For example, antigens specific for the 
target cell may not be available and many potential target cells and 
infective organisms can alter their antigenic make up rapidly to avoid 
immune recognition. In view of the extreme toxicity of proteins such as 
ricin, the lack of specificity of the immunotoxins may severely limit their 

15 usefulness as therapeutics for the treatment of cancer and infectious 
diseases. 

The insertion of intramolecular protease cleavage sites 
between the cytotoxic and cell-binding components of a toxin can mimic 
the way that the natural toxin is activated. European patent application no. 

20 466,222 describes the use of maize-derived pro-proteins which can be 
converted into active form by cleavage with extracellular blood enzymes 
such as factor Xa, thrombin or collagenase. Garred, O. et aL (/. Biol Chem. 
270:10817-10821 (1995)) documented the use of a ubiquitous calcium- 
dependent serine protease, furin, to activate shiga toxin by cleavage of the 

25 trypsin-sensitive linkage between the cytotoxic A-chain and the pentamer 
of cell-binding B-units. Westby et al. (Bioconjugate Chem. 3:375-381 (1992)) 
documented fusion proteins which have a specific cell binding component 
and proricin with a protease sensitive cleavage site specific for factor Xa 
within the linker sequence. O'Hare et al. {FEES Lett 273:200-204 (1990)) 

30 also described a recombinant fusion protein of RTA and staphylococcal 
protein A joined by a trypsin-sensitive cleavage site. In view of the 
ubiquitous nature of the extracellular proteases utilized in these 
approaches, such artificial activation of the toxin precursor or 
immunotoxin does not confer a mechanism for intracellular toxin 
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activation and the problems of target specificity and adverse 
immunological reactions to the cell-binding component of the 
immunotoxin remain. 

In a variation of the approach of insertion of intramolecular 
5 protease cleavage sites on proteins which combine a binding chain and a 
toxic chain, Leppla, S.H. et al (Bacterial Protein Toxins zbl.bakt.suppl. 
24:431-442 (1994)) suggest the replacement of the native cleavage site of 
the protective antigen (PA) produced by Bacillus anthracis with a cleavage 
site that is recognized by cells that contain a particular protease. PA, 

10 recognizes, binds, and thereby assists in the internalization of lethal factor 
(LF) and edema toxin (ET). also produced by Bacillus anthracis. However, 
this approach is wholly dependent on the availability of LF, or ET and PA 
all being localized to cells wherein the modified PA can be activated by the 
specific protease. It does not confer a mechanism for intracellular toxin 

15 activation and presents a problem of ensuring sufficient quantities of toxin 
for internalization in target cells. 

The in vitro activation of a Staphylococcus-derived pore- 
forming toxin, a-hemolysin by extracellular tumour-associated proteases 
has been documented (Panchel, R.G. et aL, Nature Biotechnology 14:852-857 

20 (1996)). Artificial activation of a-hemolysin in vitro by said proteases was 
reported but the actual activity and utility of a-hemolysin in the 
destruction of target cells were not demonstrated. 

Hemolysin does not inhibit protein synthesis but is a 
heptameric transmembrane pore which acts as a channel to allow leakage 

25 of molecules up to 3 kD thereby disrupting the ionic balances of the living 
cell. The a-hemolysin activation domain is likely located on the outside of 
the target cell (for activation by extracellular proteases). The triggering 
mechanism in the disclosed hemolysin precursor does not involve the 
intracellular proteolytic cleavage of 2 functionally distinct domains. Also, 

30 the proteases used for the a-hemolysin activation are ubitquitiously 
secreted extracellular proteases and toxin activation would not be confined 
to activation in the vicinity of diseased cells. Such widespread activation of 
the toxin does not confer target specificity and limits the usefulness of said 
a-hemolysin toxin as therapeutics due to systemic toxicity. 
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A variety of proteases specifically associated with 
malignancy, viral infections and parasitic infections have been identified 
and described. For example, cathepsin is a family of serine, cysteine or 
aspartic endopeptidases and exopeptidases which has been implicated to 
5 play a primary role in cancer metastasis (Schwartz, M.K., Clin. Chim. Acta 
237:67-78 (1995); Spiess, E. et al., J. Histochem. Cytochem. 42:917-929 (1994); 
Scarborough, RE. et al., Protein Sci 2:264-276 (1993); Sloane, BR. et al, Proc. 
Natl Acad. Sci. USA 83:2483-2487 (1986); Mikkelsen, T. et al., /. Neurosurge 
83:285-290 (1995)). Matrix metalloproteinases (MMPs or matrixins) are 

10 zinc-dependent proteinases consisting of collagenases, matrilysin, 
stromelysins, gelatinases and macrophage elastase (Krane, S.M., Ann, N.Y. 
Acad. Set 732:1-10 (1994); Woessner, J.R, Ann. N.Y. Acad. Sci. 732:11-21 
(1994); Carvalho, K. et al, Biochem. Biophys. Res. Comm. 191:172-179 (1993); 
Nakano, A. et al /. of Neurosurge, 83:298-307 (1995); Peng, K-W, et al 

15 Human Gene Therapy, 8:729-738 (1997); More, D.H. et aL Gynaecologic 
Oncology, 65:78-82 (1997)). These proteases are involved in pathological 
matrix remodeling. Under normal physiological conditions, regulation of 
matrixin activity is effected at the level of gene expression. Enzymatic 
activity is also controlled stringently by tissue inhibitors of 

20 metalloproteinases (TIMPs) (Murphy, G. et al., Ann. N.Y. Acad. Sci 732:31- 
41 (1994)). The expression of MMP genes is reported to be activated in 
inflammatory disorders (e.g. rheumatoid arthritis) and malignancy. 

In malaria, parasitic serine and aspartic proteases are 
involved in host erythrocyte invasion by the Plasmodium parasite and in 

25 hemoglobin catabolism by intraerythrocytic malaria (O'Dea, K.P. et al., 
Mol. Biochem. Parasitol. 72:111-119 (1995); Blackman, MJ. et al., Mol. Biochem. 
Parasitol. 62:103-114 (1993); Cooper, J.A. et al, Mol Biochem. Parasitol 56:151- 
160 (1992); Goldberg, D.E. et aL, /. Exp. Med. 173:961-969 (1991)). 
Schistosoma mansoni is also a pathogenic parasite which causes 

30 schistosomiasis or bilharzia. Elastinolytic proteinases have been associated 
specifically with the virulence of this particular parasite (McKerrow, J.H. et 
al.,/. Biol Chem. 260:3703-3707 (1985)). 

Welch, A.R. et al. (Proc. Natl Acad. Sci. USA 88:10797-10800 
(1991)) has described a series of viral proteases which are specifically 
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associated with human cytomegalovirus, human herpesviruses, Epstein- 
Bar r virus, varicella zoster virus-L and infectious laryngotracheitis virus. 
These proteases possess similar substrate specificity and play an integral 
role in viral scaffold protein restructuring in capsid assembly and virus 
5 maturation. Other viral proteases serving similar functions have also been 
documented for human T-cell leukemia virus (Blaha, L et aL, FEBS Lett. 
309:389-393 (1992); Pettit, S.C. et aL, /. Biol Chem. 266:14539-14547 (1991)), 
hepatitis viruses (Hirowatari, Y. et aL, Anal Biochem. 225:113-120 (1995); 
Hirowatari, Y. et aL, Arch. Virol 133:349-356 (1993); Jewell, D.A. et aL, 
10 Biochemistry 31:7862-7869 (1992)), poliomyelitis virus (Weidner, J.R. et aL, 
Arch. Biochem. Biophys. 286:402-408 (1991)), and human rhinovirus (Long, 
A.C. et aL, FEBS Lett 258:75-78 (1989)). 

Candida yeasts are dimorphic fungi which are responsible 
for a majority of opportunistic infections in AIDS patients (Holmberg, K. 
15 and Myer, R., Scand. J. Infect Dis. 18:179-192 (1986)). Aspartic proteinases 
have been associated specifically with numerous virulent strains of Candida 
including Candida albican, Candida tropicalis, and Candida parapsilosis (Abad- 
Zapatero, C. et aL, Protein Set 5:640-652 (1996); Cutfield, S.M. et aL, 
Biochemistry 35:398-410 (1995); Ruchel, R. et al, Zentralbl Bakteriol Mikrobiol 
20 Hyg. I Abt. Orig. A. 255:537-548 (1983); Remold, H. et aL, Biochim. Biophys. 
Acta 167:399-406 (1968)), and the levels of these enzymes have been 
correlated with the lethality of the strain (Schreiber, B, et aL, Diagn. 
Microbiol Infect Dis. 3:1-5 (1985)). 
SUMMARY OF THE INVENTION 
25 The invention relates to novel recombinant toxic proteins 

which are specifically toxic to diseased cells but do not depend for their 
specificity of action on a specific cell binding component. The recombinant 
proteins of the invention have an A chain of a ricin-like toxin linked to a B 
chain by a synthetic linker sequence which may be cleaved specifically by a 
30 protease localised in cells or tissues affected by a specific disease to liberate 
the toxic A chain thereby selectively inhibiting or destroying the diseased 
cells or tissues. The term diseased cells as used herein, includes cells 
affected by cancer, or infected by fungi, or viruses, including retroviruses, 
or parasites. 
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Toxin targeting using the recombinant toxic proteins of the 
invention takes advantage of the fact that many DNA viruses exploit host 
cellular transport mechanisms to escape immunological destruction. This is 
achieved by enhancing the retrograde translocation of host major 

5 histocompatibility complex (MHC) type I molecules from the endoplasmic 
reticulum into the cytoplasm (Bonifacino, J.S., Nature 384: 405-406 (1996); 
Wiertz, E.J. et al, Nature 384: 432-438 (1996)). The facilitation of retrograde 
transport in diseased cells by the virus can enhance the transcytosis and 
cytotoxicity of a recombinant toxic protein of the present invention 

10 thereby further reducing non-specific cytotoxicity and improving the 
overall safety of the product. 

The recombinant toxic proteins of the present invention 
may be used to treat diseases including various forms of cancer such as T- 
and B-cell lymphoproliferative diseases, ovarian cancer, pancreatic cancer, 

15 head and neck cancer, squamous cell carcinoma, gastrointestinal cancer, 
breast cancer, prostate cancer, non small cell lung cancer, malaria, and 
diverse viral disease states associated with infection with human 
cytomegalovirus, hepatitis virus, herpes virus, human rhinovirus, 
infectious laryngotracheitis virus, poliomyelitis virus, or varicella zoster 

20 virus. 

In one aspect, the present invention provides a purified and 
isolated nucleic acid having a nucleotide sequence encoding an A chain of a 
ricin-like toxin, a B chain of a ricin-like toxin and a heterologous linker 
amino acid sequence, linking the A and B chains. The linker sequence is not 

25 a native linker sequence of a ricin-like toxin, but rather a synthetic 
heterologous linker sequence containing a cleavage recognition site for a 
disease-specific protease. The A and or the B chain may be those of ricin. 

In an embodiment, of the invention the cleavage 
recognition site is the cleavage recognition site for a cancer-associated 

30 protease. In particular embodiments, the linker amino acid sequence 
comprises SLLKSRMVPNFN or SLLIARRMPNFN cleaved by cathepsin B; 
SKLVQASASGVN or SSYLKASDAPDN cleaved by an Epstein-Barr virus 
protease; RPKPQQFFGLMN cleaved by MMP-3 (stromelysin); 
SLRPLALWRSFN cleaved by MMP-7 (matrilysin); SPQGIAGQRNFN 
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cleaved by MMP-9; DVDERDVRGFASFL cleaved by a thermolysin-like 
MMP; SLPLGLWAPNFN cleaved by matrix metalloproteinase 2(MMP-2) ; 
SLLIFRSWANFN cleaved by cathespin L; SGVVIATVIVIT cleaved by 
cathespin D; SLGPQGIWGQFN cleaved by matrix metalloproteinase 
5 l(MMP-l); KKSPGRVVGGSV cleaved by urokinase-type plasminogen 
activator; PQGLLGAPGILG cleaved by membrane type 1 
matrixmetalloproteinase (MT-MMP); 

HGPEGLRVGFYESDVMGRGHARLVHVEEPHT cleaved by stromelysin 3 
(or MMP-11), thermolysin, fibroblast collagenase and stromelysin-1; 
10 GPQGLAGQRGIV cleaved by matrix metalloproteinase 13 (collagenase-3); 
GGSGQRGRKALE cleaved by tissue-type plasminogen activator(tPA); 
SLSALLSSDIFN cleaved by human prostate-specific antigen; 
SLPRFKIIGGFN cleaved by kallikrein (hK3); SLLGIAVPGNFN cleaved by 
neutrophil elastase; and FFKNIVTPRTPP cleaved by calpain (calcium 
15 activated neutral protease). The nucleic acid sequences for ricin A and B 
chains with each of the linker sequences are shown in Figures 2D, 35C, 3D, 
4D, 5D, 6D, 16D, 17D, 34C, 36C , 37C, 38C , 39C, 40C, 41C, 42C , 43C, 44C, 
45C, 46C and 47C, respectively. 

In another embodiment, the cleavage recognition site is the 
20 cleavage recognition site for a protease associated with the malaria 
parasite, Plasmodium falciparum. In particular embodiments, the linker 
amino acid sequence comprises QWQLQNYDEED; LPIFGESEDNDE; 
QVVTGEAISVTM; ALERTFLSFPTN or KFQDMLNISQHQ. The nucleic 
nucleotide sequences for ricin A and B chains with each of the linker 
25 sequences are shown in Figures 7D, 8D, 9D, 10D, and 11D. 

In a another embodiment, the cleavage recognition site is 
the cleavage recognition site for a viral protease. The linker sequences 
preferably comprise the sequence Y-X-Y-A-Z wherein X is valine or 
leucine, Y is a polar amino acid, and Z is serine, asparagine or valine. In 
30 particular embodiments, the linker amino acid sequence comprises 
SGVVNASCRLAN or SSYVKASVSPEN cleaved by a human 
cytomegalovirus protease; SALVNASSAHVN or STYLQASEKFKN cleaved 
by a herpes simplex 1 virus protease; SSILNASVPNFN cleaved by a 
human herpes virus 6 protease; SQDVNAVEASSN or SVYLQASTGYGN 



-12- 



cleaved by a varicella zoster virus protease; or SKYLQANEVITN cleaved 
by an infectious laryngotracheitis virus protease. The nucleic nucleotide 
sequences for ricin A and B chains with each of the linker sequences are 
shown in Figures 12D, 13D, 14D, 15D, 18D, 19D, 20D, and 22D. 
5 In another embodiment, the cleavage recognition site is the 

cleavage recognition site for a hepatitis A viral protease. In particular 
embodiments, the linker amino acid sequence comprises SELRTQSFSNWN 
or SELWSQGIDDDN cleaved by a hepatitis A virus protease. The nucleic 
nucleotide sequences for ricin A and B chains with each of the linker 
10 sequences are shown in Figures 23D or 24D. 

In another embodiment, the cleavage recognition site is the 
cleavage recognition site for a hepatitis C viral protease. In particular 
embodiments, the linker amino acid sequence comprises 
DLEVVTSTWVFN, DEMEECASHLFN, EDVVCCSMSYFN or 
15 KGWRLLAPITAY cleaved by a hepatitis C virus protease. The nucleic 
nucleotide sequences for ricin A and B chains with each of the linker 
sequences are shown in Figures 30C, 31C, 32C and 33C 

In another embodiment, the cleavage recognition site is the 
cleavage recognition site for a Candida fungal protease. In particular 
20 embodiments, the linker amino acid sequence is SKPAKFFRLNFN, 
SKPIEFFRLNFN or SKPAEFFALNFN cleaved by Candida aspartic 
protease. The nucleic nucleotide sequences for ricin A and B chains with 
the first linker sequence are shown in Figures 25D. 

The present invention also provides a plasmid incorporating 
25 the nucleic acid of the invention. In an embodiment, the plasmid has the 
restriction map as shown in Figures 2A, 3A, 4A, 5A, 6A, 7A, 8A, 9A, 10A, 
11A, 12A, 13A, 14A, 15A, 16A, 17A, 18A, 19 A, 20A, 21A, 22A, 23A, 24A, or 
25A. 

In another embodiment, the present invention provides a 
30 baculovirus transfer vector incorporating the nucleic acid of the invention. 
In particular embodiments, the invention provides a baculovirus transfer 
vector having the DNA sequence as shown in Figure 1. 

In a further embodiment, the present invention provides a 
baculovirus transfer vector incorporating the nucleic acid of the invention. 
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In particular embodiments, the invention provides a baculovirus transfer 
vector having the restriction map as shown in Figures 2C, 3C, 4C, 5C, 6C, 
7C, 8C, 9C, IOC, 11C, 12C, 13C, 14C, 15C, 16C, 17C, 18C, 19C, 20C, 21C, 
22C, 23C, 24C, 25C, 30A, 31A, 32A, 33A, 34A, 35A, 36A, 37A, 38A, 39 A, 
5 40A, 41 A, 42A, 43 A, 44A, 45A, 46A, or 47A. or having the DNA sequence as 
shown in Figure 1. 

In a further aspect, the present invention provides a 
recombinant protein comprising an A chain of a ricin-like toxin, a B chain 
of a ricin-like toxin and a heterologous linker amino acid sequence, linking 

10 the A and B chains, wherein the linker sequence contains a cleavage 
recognition site for a disease-specific protease (e.g.. a cancer, viral, parasitic, 
or fungal protease). The A and/ or the B chain may be those of ricin. In an 
embodiment, the cleavage recognition site is the cleavage recognition site 
for a cancer, viral or parasitic protease substantially as described above. In 

15 a particular embodiment, the cancer is T-cell or B-cell lymphoproliferative 
disease. In another particular embodiment, the virus is human 
cytomegalovirus, Epstein-Barr virus, hepatitis virus, herpes virus, human 
rhinovirus, infectious laryngotracheitis virus, poliomyelitis virus, or 
varicella zoster virus. In a further particular embodiment, the parasite is 

20 Plasmodium falciparum. 

In a further aspect, the invention provides a pharmaceutical 
composition for treating a fungal infection, such as Candida, in a mammal 
comprising the recombinant protein of the invention and a 
pharmaceutically acceptable carrier, diluent or excipient. 

25 In yet another aspect, the invention provides a method of 

inhibiting or destroying cells affected by a disease, which cells are 
associated with a disease specific protease, including cancer or infection 
with a virus, fungus, or a parasite each of which has a specific protease, 
comprising the steps of preparing a recombinant protein of the invention 

30 having a heterologous linker sequence which contains a cleavage 
recognition site for the disease-specific protease and administering the 
recombinant protein to the cells. In an embodiment, the cancer is T-cell or 
B-cell lymphoproliferative disease, ovarian cancer, pancreatic cancer, head 
and neck cancer, squamous cell carcinoma, gastrointestinal cancer, breast 
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cancer, prostate cancer, non small cell lung cancer. In another 
embodiment, the virus is human cytomegalovirus, Epstein-Barr virus, 
hepatitis virus, herpes virus, human rhinovirus, human T-cell leukemia 
virus, infectious laryngotracheitis virus, poliomyelitis virus, or varicella 
5 zoster virus. In another embodiment, the parasite is Plasmodium falciparum. 

The present invention also relates to a method of treating a 
mammal with disease wherein cells affected by the disease are associated 
with a disease specific protease, including cancer or infection with a virus, 
fungus, or a parasite each of which has a specific protease by administering 
10 an effective amount of one or more recombinant proteins of the invention 
to said mammal. 

® Still further, a process is provided for preparing a 

If! pharmaceutical for treating a mammal with disease wherein cells affected 

by the disease are associated with a disease specific protease, including 
Lf| 15 cancer or infection with a virus, fungus, or a parasite each of which has a 

specific protease comprising the steps of preparing a purified and isolated 
□ nucleic acid having a nucleotide sequence encoding an A chain of a 

JJC ricin-like toxin, a B chain of a ricin-like toxin and a heterologous linker 

Jp amino acid sequence, linking the A and B chains, wherein the linker 

Si 20 sequence contains a cleavage recognition site for the disease-specific 

protease; introducing the nucleic acid into a host cell; expressing the nucleic 
acid in the host cell to obtain a recombinant protein comprising an A chain 
of a ricin-like toxin, a B chain of a ricin-like toxin and a heterologous linker 
amino acid sequence, linking the A and B chains wherein the linker 
25 sequence contains the cleavage recognition site for the disease-specific 
protease; and suspending the protein in a pharmaceutical^ acceptable 
carrier, diluent or excipient. 

In an embodiment, a process is provided for preparing a 
pharmaceutical for treating a mammal with disease wherein cells affected 
30 by the disease are associated with a disease specific protease, including 
cancer or infection with a virus, fungus, or a parasite each of which has a 
specific protease comprising the steps of identifying a cleavage recognition 
site for the protease; preparing a recombinant protein comprising an A 
chain of a ricin-like toxin, a B chain of a ricin-like toxin and a heterologous 
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linker amino acid sequence, linking the A and B chains wherein the linker 
sequence contains the cleavage recognition site for the protease and 
suspending the protein in a pharmaceutically acceptable carrier, diluent or 
excipient. 

5 In a further aspect, the invention provides a pharmaceutical 

composition for treating for treating a mammal with disease wherein cells 
affected by the disease are associated with a disease specific protease, 
including cancer or infection with a virus, fungus, or a parasite comprising 
the recombinant protein of the invention and a pharmaceutically 
10 acceptable carrier, diluent or excipient. 

Other features and advantages of the present invention will 
become apparent from the following detailed description. It should be 
understood, however, that the detailed description and the specific 
examples while indicating preferred embodiments of the invention are 
15 given by way of illustration only, since various changes and modifications 
within the spirit and scope of the invention will become apparent to those 
skilled in the art from this detailed description. 
DESCRIPTION OF THE DRAWINGS 

The invention will be better understood with reference to 
20 the drawings in which: 

Figure 1 shows the DNA sequence of the baculovirus 
transfer vector, pVL1393; 

Figure 2A summarizes the cloning strategy used to 
generate the pAP-213 construct; 
25 Figure 2B shows the nucleotide sequence of the Cathepsin B 

linker regions of pAP-213; 

Figure 2C shows the subcloning of the Cathepsin B linker 
variant into a baculovirus transfer vector; 

Figure 2D shows the DNA sequence of the pAP-214 insert 
30 containing ricin and the Cathepsin B linker; 

Figure 3A summarizes the cloning strategy used to 
generate the pAP-215 construct; 

Figure 3B shows the nucleotide sequence of the MMP-3 
linker regions of p AP-215; 



-16- 



Figure 3C shows the subcloning of the MMP-3 linker 
variant into a baculo virus transfer vector; 

Figure 3D shows the DNA sequence of the pAP-216 insert 
containing ricin and the MMP-3 linker; 
5 Figure 4A summarizes the cloning strategy used to 

generate the pAP-217 construct; 

Figure 4B shows the nucleotide sequence of the MMP-7 
linker regions of pAP-217; 

Figure 4C shows the subcloning of the MMP-7 linker 
10 variant into a baculovirus transfer vector; 

Figure 4D shows the DNA sequence of the pAP-218 insert 
containing ricin and the MMP-7 linker; 

Figure 5A summarizes the cloning strategy used to 
generate the pAP-219 construct; 
15 Figure 5B shows the nucleotide sequence of the MMP-9 

linker regions of pAP-219; 

Figure 5C shows the subcloning of the MMP-9 linker 
variant into a baculovirus transfer vector; 

Figure 5D shows the DNA sequence of the pAP-220 insert 
20 containing ricin and the MMP-9 linker. 

Figure 6A summarizes the cloning strategy used to 
generate the pAP-221 construct; 

Figure 6B shows the nucleotide sequence of the 
thermolysin-like MMP linker regions of pAP-221; 
25 Figure 6C shows the subcloning of the thermolysin-like 

MMP linker variant into a baculovirus transfer vector. 

Figure 6D shows the DNA sequence of the pAP-222 insert 
containing ricin and the thermolysin-like MMP linker; 

Figure 7A summarizes the cloning strategy used to 
30 generate the pAP-223 construct; 

Figure 7B shows the nucleotide sequence of the 
Plasmodium f alciparum-A linker regions of p AP-223; 

Figure 7C shows the subcloning of the Plasmodium 
falciparum-A linker variant into a baculovirus transfer vector; 
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Figure 7D shows the DNA sequence of the pAP-224 insert 
containing ricin and the Plasmodium falciparum-A linker; 

Figure 8A summarizes the cloning strategy used to 
generate the pAP-225 construct; 
5 Figure 8B shows the nucleotide sequence of the 

Plasmodium f alciparum-B linker regions of p AP-225; 

Figure 8C shows the subcloning of the Plasmodium 
f alciparum-B linker variant into a baculovirus transfer vector; 

Figure 8D shows the DNA sequence of the pAP-226 insert 
10 containing ricin and the Plasmodium f alciparum-B linker; 

Figure 9A summarizes the cloning strategy used to 
generate the pAP-227 construct; 

Figure 9B shows the nucleotide sequence of the 
Plasmodium falciparum-C linker regions of pAP-227; 
15 Figure 9C shows the subcloning of the Plasmodium 

falciparum-C linker variant into a baculovirus transfer vector; 

Figure 9D shows the DNA sequence of the pAP-228 insert 
containing ricin and the Plasmodium falciparum-C linker; 

Figure 10A summarizes the cloning strategy used to 
20 generate the pAP-229 construct; 

Figure 10B shows the nucleotide sequence of the 
Plasmodium falciparum-D linker regions of pAP-229; 

Figure 10C shows the subcloning of the Plasmodium 
falciparum-D linker variant into a baculovirus transfer vector; 
25 Figure 10D shows the DNA sequence of the pAP-230 insert 

containing ricin and the Plasmodium falciparum-D linker; 

Figure 11 A summarizes the cloning strategy used to 
generate the pAP-231 construct; 

Figure 11B shows the nucleotide sequence of the 
30 Plasmodium falciparum-E linker regions of pAP-231; 

Figure 11C shows the subcloning of the Plasmodium 
falciparum-E linker variant into a baculovirus transfer vector- 
Figure 11D shows the DNA sequence of the pAP-232 insert 
containing ricin and the Plasmodium falciparum-E linker; 
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Figure 12A summarizes the cloning strategy used to 
generate the p AP-233 construct; 

Figure 12B shows the nucleotide sequence of the HSV-A 
linker regions of p AP-233; 
5 Figure 12C shows the subcloning of the HSV-A linker 

variant into a baculovirus transfer vector; 

Figure 12D shows the DNA sequence of the pAP-234 insert 
containing ricin and the HSV-A linker; 

Figure 13A summarizes the cloning strategy used to 
10 generate the pAP-235 construct; 

Figure 13B shows the nucleotide sequence of the HSV-B 
linker regions of p AP-235; 

Figure 13C shows the subcloning of the HSV-B linker 
variant into a baculovirus transfer vector; 
15 Figure 13D shows the DNA sequence of the pAP-236 insert 

containing ricin and the HSV-B linker; 

Figure 14A summarizes the cloning strategy used to 
generate the pAP-237 construct; 

Figure 14B shows the nucleotide sequence of the VZV-A 
20 linker regions of pAP-237; 

Figure 14C shows the subcloning of the VZV-A linker 
variant into a baculovirus transfer vector; 

Figure 14D shows the DNA sequence of the pAP-238 insert 
containing ricin and the VZV-A linker; 
25 Figure 15 A summarizes the cloning strategy used to 

generate the p AP-239 construct; 

Figure 15B shows the nucleotide sequence of the VZV-B 
linker regions of p AP-239; 

Figure 15C shows the subcloning of the VZV-B linker 
30 variant into a baculovirus transfer vector; 

Figure 15D shows the DNA sequence of the pAP-240 insert 
containing ricin and the VZV-B linker; 

Figure 16A summarizes the cloning strategy used to 
generate the p AP-241 construct; 
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Figure 16B shows the nucleotide sequence of the EBV-A 
linker regions of pAP-241; 

Figure 16C shows the subcloning of the EBV-A linker 
variant into a baculovirus transfer vector; 
5 Figure 16D shows the DNA sequence of the pAP-242 insert 

containing ricin and the EBV-A linker; 

Figure 17A summarizes the cloning strategy used to 
generate the pAP-243 construct; 

Figure 17B shows the nucleotide sequence of the EBV-B 
10 linker regions of pAP-243; 

Figure 17C shows the subcloning of the EBV-B linker 
variant into a baculovirus transfer vector; 

Figure 17D shows the DNA sequence of the pAP-244 insert 
containing ricin and the EBV-B linker; 
15 Figure 18A summarizes the cloning strategy used to 

generate the p AP-245 construct; 

Figure 18B shows the nucleotide sequence of the CMV-A 
linker regions of p AP-245; 

Figure 18C shows the subcloning of the CMV-A linker 
20 variant into a baculovirus transfer vector; 

Figure 18D shows the DNA sequence of the pAP-246 insert 
containing ricin and the CMV-A linker; 

Figure 19A summarizes the cloning strategy used to 
generate the pAP-247 construct; 
25 Figure 19B shows the nucleotide sequence of the CMV-B 

linker regions of pAP-247; 

Figure 19C shows the subcloning of the CMV-B linker 
variant into a baculovirus transfer vector; 

Figure 19D shows the DNA sequence of the pAP-248 insert 
30 containing ricin and the CMV-B linker. 

Figure 20A summarizes the cloning strategy used to 
generate the pAP-249 construct; 

Figure 20B shows the nucleotide sequence of the HHV-6 
linker regions of pAP-249; 
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Figure 20C shows the subcloning of the HHV-6 linker 
variant into a baculovirus transfer vector; 

Figure 20D shows the DNA sequence of the pAP-250 insert 
containing ricin and the HHV-6 linker; 

Figure 21 shows the amino acid sequences of the wild type 
ricin linker and cancer protease-sensitive amino acid linkers contained in 
pAP-213 to pAP-222 and linkers pAP-241 to pAP-244; 

Figure 22A summarizes the cloning strategy used to 
generate the pAP-253 construct; 

Figure 22B shows the nucleotide sequence of the ILV linker 
regions of pAP-253; 

Figure 22C shows the subcloning of the ILV linker variant 
into a baculovirus transfer vector; 

Figure 22D shows the DNA sequence of the pAP-254 insert 
containing ricin and the ILV linker; 

Figure 23A summarizes the cloning strategy used to 
generate the pAP-257 construct; 

Figure 23B shows the nucleotide sequence of the HAV-A 
linker regions of p AP-257; 

Figure 23C shows the subcloning of the HAV-A linker 
variant into a baculovirus transfer vector; 

Figure 23D shows the DNA sequence of the pAP-258 insert 
containing ricin and the HAV-A linker; 

Figure 24A summarizes the cloning strategy used to 
generate the pAP-255 construct; 

Figure 24B shows the nucleotide sequence of the HAV-B 
linker regions of p AP-255; 

Figure 24C shows the subcloning of the HAV-B linker 
variant into a baculovirus transfer vector; 

Figure 24D shows the DNA sequence of the pAP-256 insert 
containing ricin and the HAV-B linker; 

Figure 25A summarizes the cloning strategy used to 
generate the pAP-259 construct; 
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Figure 25B shows the nucleotide sequence of the CAN 
linker regions of p AP-259; 

Figure 25C shows the subcloning of the CAN linker variant 
into a baculovirus transfer vector; 
5 Figure 25D shows the DNA sequence of the pAP-260 insert 

containing ricin and the CAN linker; 

Figure 26 shows the amino acid sequences of the wild type 
ricin linker and Plasmodium falciparum protease-sensitive amino acid linkers 
contained in linkers pAP-223 to p AP-232; 
10 Figure 27 shows the amino acid sequences of the wild type 

ricin linker and the viral protease-sensitive amino acid linkers contained in 
pAP-233 to pAP-240, p AP-245-pAP-248, pAP-253 to pAP-258; 

Figure 28 shows the amino acid sequences of the wild type 
ricin linker and the Candida aspartic protease-sensitive amino acid linker 
15 contained in pAP-259 to pAP-264; 

Figure 29 describes an alternative mutagenesis and 
subcloning strategy to provide a baculovirus transfer vector containing the 
ricin-like toxin variant gene; and 

Figure 30A summarizes the cloning strategy used to 
20 generate the pAP-262 construct; 

Figure 30B shows the nucleotide sequence of the HCV-A 
linker region of p AP-262; 

Figure 30C shows the DNA sequence of the p AP-262 insert; 

Figure 30D shows the amino acid sequence comparison of 
25 mutant preproricin linker region HCV-A to wild type; 

Figure 31A summarizes the cloning strategy used to 
generate the p AP-264 construct; 

Figure 31B shows the nucleotide sequence of the HCV-B 
linker region of p AP-264; 
30 Figure 31C shows the DNA sequence of the p AP-264 insert; 

Figure 31D shows the amino acid sequence comparison of 
mutant preproricin linker region HCV-B to wild type; 

Figure 32A summarizes the cloning strategy used to 
generate the pAP-266 construct; 
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Figure 32B shows the nucleotide sequence of the HCV-C 
linker region of pAP-266; 

Figure 32C shows the DNA sequence of the p AP-266 insert; 

Figure 32D shows the amino acid sequence comparison of 
5 mutant preproricin linker region HCV-C to wild type; 

Figure 33A summarizes the cloning strategy used to 
generate the pAP-268 construct; 

Figure 33B shows the nucleotide sequence of the HCV-D 
linker region of pAP-268; 
10 Figure 33C shows the DNA sequence of the pAP-268 insert; 

Figure 33D shows the amino acid sequence comparison of 
mutant preproricin linker region HCV-D to wild type; 

Figure 34A summarizes the cloning strategy used to 
generate the p AP-270 construct; 
15 Figure 34B shows the nucleotide sequence of the MMP-2 

linker region of pAP-270; 

Figure 34C shows the DNA sequence of the pAP-270 insert; 

Figure 34D shows the amino acid sequence comparison of 
mutant preproricin linker region of MMP-2 to wild type; 
20 Figure 35A summarizes the cloning strategy used to 

generate the pAP-272 construct; 

Figure 35B shows the nucleotide sequence of the Cathepsin 
B (Site 2) linker region of pAP-272; 

Figure 35C shows the DNA sequence of the pAP-272 insert; 
25 Figure 35D shows the amino acid sequence comparison of 

mutant preproricin linker region of Cathepsin B (Site 2) to wild type; 

Figure 36A summarizes the cloning strategy used to 
generate the pAP-274 construct; 

Figure 36B shows the nucleotide sequence of the Cathepsin 
30 L linker region of pAP-274; 

Figure 36C shows the DNA sequence of the pAP-274 insert; 

Figure 36D shows the amino acid sequence comparison of 
mutant preproricin linker region of Cathepsin L to wild type; 
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Figure 37A summarizes the cloning strategy used to 
generate the pAP-276 construct; 

Figure 37B shows the nucleotide sequence of the Cathepsin 
D linker region of p AP-276; 
5 Figure 37C shows the DNA sequence of the pAP-276 insert; 

Figure 37D shows the amino acid sequence comparison of 
mutant preproricin linker region of Cathepsin D to wild type; 

Figure 38A summarizes the cloning strategy used to 
generate the pAP-278 construct; 
10 Figure 38B shows the nucleotide sequence of the MMP-1 

linker region of pAP-278; 

Figure 38C shows the DNA sequence of the pAP-278 insert; 

Figure 38D shows the amino acid sequence comparison of 
mutant preproricin linker region of MMP-1 to wild type; 
15 Figure 39A summarizes the cloning strategy used to 

generate the pAP-280 construct; 

Figure 39B shows the nucleotide sequence of the Urokinase- 
Type Plasminogen Activator linker region of pAP-280; 

Figure 39C shows the DNA sequence of the pAP-280 insert; 
20 Figure 39D shows the amino acid sequence comparison of 

mutant preproricin linker region of Urokinase-Type Plasminogen 
Activator to wild type; 

Figure 40A summarizes the cloning strategy used to 
generate the pAP-282 construct; 
25 Figure 40B shows the nucleotide sequence of the MT-MMP 

linker region of p AP-282; 

Figure 40C shows the DNA sequence of the pAP-282 insert; 

Figure 40D shows the amino acid sequence comparison of 
mutant preproricin linker region of MT-MMP to wild type; 
30 Figure 41A summarizes the cloning strategy used to 

generate the pAP-284 construct; 

Figure 41B shows the nucleotide sequence of the MMP-1 1 
linker region of p AP-284; 

Figure 41C shows the DNA sequence of the p AP-284 insert; 
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Figure 41D shows the amino acid sequence comparison of 
mutant preproricin linker region of MMP-11 to wild type; 

Figure 42A summarizes the cloning strategy used to 
generate the pAP-286 construct; 
5 Figure 42B shows the nucleotide sequence of the MMP-13 

linker region of pAP-286; 

Figure 42C shows the DNA sequence of the pAP-286 insert; 

Figure 42D shows the amino acid sequence comparison of 
mutant preproricin linker region of MMP-13 to wild type; 
10 Figure 43A summarizes the cloning strategy used to 

generate the pAP-288 construct; 

Figure 43B shows the nucleotide sequence of the Tissue- 
type Plasminogen Activator linker region of pAP-288; 

Figure 43C shows the DNA sequence of the pAP-288 insert; 
15 Figure 43D shows the amino acid sequence comparison of 

mutant preproricin linker region of Tissue-type Plasminogen Activator to 
wild type; 

Figure 44A summarizes the cloning strategy used to 
generate the pAP-290 construct; 
20 Figure 44B shows the nucleotide sequence of the human 

Prostate-Specific Antigen linker region of pAP-290; 

Figure 44C shows the DNA sequence of the pAP-290 insert; 
Figure 44D shows the amino acid sequence comparison of 
mutant preproricin linker region of the human Prostate-Specific Antigen to 
25 wild type; 

Figure 45A summarizes the cloning strategy used to 
generate the pAP-292 construct; 

Figure 45B shows the nucleotide sequence of the kallikrein 
linker region of pAP-292; 
30 Figure 45C shows the DNA sequence of the pAP-292 insert; 

Figure 45D shows the amino acid sequence comparison of 
mutant preproricin linker region of the kallikrein to wild type; 

Figure 46A summarizes the cloning strategy used to 
generate the pAP-294 construct; 
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Figure 46B shows the nucleotide sequence of the neutrophil 
elastase linker region of pAP-294; 

Figure 46C shows the DNA sequence of the pAP-294 insert; 

Figure 46D shows the amino acid sequence comparison of 
5 mutant preproricin linker region of neutrophil elastase to wild type; 

Figure 47A summarizes the cloning strategy used to 
generate the pAP-296 construct; 

Figure 47B shows the nucleotide sequence of the calpain 
linker region of pAP-296; 
10 Figure 47C shows the DNA sequence of the pAP-296 insert; 

Figure 47D shows the amino acid sequence comparison of 
mutant preproricin linker region of calpain to wild type; 

Figure 48 is a blot showing cleavage of pAP-214 by 

Cathepsin B; 

15 Figure 49 is a blot showing cleavage of pAP-220 with 



MMP-9; 



20 HCMV. 



3C 



Figure 50 is a blot showing activation of pAP-214; and 

Figure 51 is a blot showing activation of pAP-220. 

Figure 52 is a blot showing cleavage of pAP-248 with 

Figure 53 is a blot showing activation of pAP-248- 

Figure 54 is a blot showing cleavage of pAP-256 by HAV 



Figure 55 is a blot showing activation of pAP-256. 
25 Figure 56 is a semi-logithmic graph illustrating the 

cytotoxicity to COS-1 cells of undigested pAP-214 and pAP-214 
digestedwith Cathepsin B. 

Figure 57 is a semi-logithmic graph illustrating the 
cytotoxicity of pAP-220 digested with MMP-9 compared to freshly 
30 thawed pAP-220 and ricin on COS-1 cells. 

Figure 58 is a blot showing cleavage of pAP-270 with 

MMP-2. 

Figure 59 is a blot showing activation of pAP-270. 
Figure 60 is a blot showing cleavage of p AP-288 by t-PA. 
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Figure 61 is a blot showing activation of pAP-288. 

Figure 62 is a blot showing cleavage of pAP-294 with 
human neutrophil elastase. 

Figure 63 is a blot showing activation of pAP-294. 
5 Figure 64 is a blot showing cleavage of pAP-296 with 

calpain. 

Figure 65 is a blot showing activation of pAP-296. 
Figure 66 is a blot showing cleavage of p AP-222 with 

MMP-2. 

10 Figure 67 is a blot showing activation of pAP-222. 

DETAILED DESCRIPTION OF THE INVENTION 
Nucleic Acid Molecules of the Invention 

As mentioned above, the present invention relates to novel 
nucleic acid molecules comprising a nucleotide sequence encoding an A 

15 chain of a ricin-like toxin, a B chain of a ricin- like toxin and a heterologous 
linker amino acid sequence, linking the A and B chains. The heterologous 
linker sequence contains a cleavage recognition site for a disease-specific 
protease (e.g. a viral protease, parasitic protease, cancer-associated 
protease, or a fungal protease). 

20 The term "isolated and purified" as used herein refers to a 

nucleic acid substantially free of cellular material or culture medium when 
produced by recombinant DNA techniques, or chemical precursors, or 
other chemicals when chemically synthesized. An "isolated and purified" 
nucleic acid is also substantially free of sequences which naturally flank the 

25 nucleic acid {i.e. sequences located at the 5 ! and 3' ends of the nucleic acid) 
from which the nucleic acid is derived. The term "nucleic acid" is intended 
to include DNA and RNA and can be either double stranded or single 
stranded. 

The term "linker sequence" as used herein refers to an 
30 internal amino acid sequence within the protein encoded by the nucleic 
acid molecule of the invention which contains residues linking the A and B 
chain so as to render the A chain incapable of exerting its toxic effect, for 
example catalytically inhibiting translation of a eukaryotic ribosome. By 
heterologous is meant that the linker sequence is not a sequence native to 
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the A or B chain of a ricin-like toxin or precursor thereof. However, 
preferably, the linker sequence may be of a similar length to the linker 
sequence of a ricin-like toxin and should not interfere with the role of the B 
chain in cell binding and transport into the cytoplasm. When the linker 
5 sequence is cleaved the A chain becomes active or toxic. 

The nucleic acid molecule of the invention is cloned by 
subjecting a preproricin cDNA clone to site-directed mutagenesis in order 
to generate a series of variants differing only in the sequence between the 
A and B chains (linker region). Oligonucleotides, corresponding to the 
10 extreme 5' and 3 T ends of the preproricin gene are synthesized and used to 
PCR amplify the gene. Using the cDNA sequence for preproricin (Lamb et 
al v Eur. J. Biochem. 145:266-270 (1985)), several oligonucleotide primers are 
designed to flank the start and stop codons of the preproricin open reading 
frame. 

15 The preproricin cDNA is amplified using the upstream 

primer Ricin-99 or Ricin-109 and the downstream primer Ricinl729C with 
Vent DNA polymerase (New England Biolabs) using standard procedures 
(Sambrook et aL, Molecular Cloning: A Laboratory Manual, Second 
Edition, (Cold Spring Harbor Laboratory Press, 1989)). The purified PCR 

20 fragment encoding the preproricin cDNA is then ligated into an Eco 
Rl-digested pBluescript II SK plasmid (Stratagene), and is used to 
transform competent XLl-Blue cells (Stratagene). The cloned PCR product 
containing the putative preproricin gene is confirmed by DNA sequencing 
of the entire cDNA clone . The sequences and location of oligonucleotide 

25 primers used for sequencing are shown in Table 1. 

The preproricin cDNA clone is subjected to site directed 
mutagenesis in order to generate a series of variants differing only in the 
sequence between the A and B chains (linker region). The wild-type 
preproricin linker region is replaced with the heterogenous linker 

30 sequences that are cleaved by the various disease-specific proteases as 
shown in Figures 21, 26, 27, 28, and Part D of Figures 30-47. Linker 
identification as used herein in connection with the sequences provided in 
these figures have been assigned the sequence ID numbers as discussed 
below. 
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The linker regions of the variants encode a cleavage 
recognition sequence for a disease-specific protease associated with for 
example, cancer, viruses, parasites, or fungiL The mutagenesis and cloning 
strategy used to generate the disease-specific protease-sensitive linker 
5 variants are summarized in Part A of Figures 2-20, and Part A of Figures 
22-25. The first step involves a DNA amplification using a set of mutagenic 
primers in combination with the two flanking primers Richin-99Eco or 
Ricin-109Eco and Ricinl729C Pst L Restriction digested PCR fragments are 
gel purified and then ligated with PBluescript SK which has been digested 
10 with Eco RI and Pst L Ligation reactions are used to transform competent 
XLl-Blue cells (Stratagene), Recombinant clones are identified by 
restriction digests of plasmid miniprep DNA and the mutant linker 
sequences are confirmed by DNA sequencing. With respect to the 
nucleotide sequences and amino acid sequences prepared as a result of the 
15 implementation of this strategy the following sequences have been 
assigned the sequence ID numbers as indicated. 

SEQ ID NO. 1 is used herein in connection with the DNA 
sequence of the baculovirus transfer vector, pVL1393. 

The nucleotide sequence of Cathepsin B linker regions of 
20 p AP-213 are referred to herein as SEQ ID NO. 2. 

The nucleotide sequence of Cathepsin B linker regions of 
pAP-214 are referred to herein as SEQ ID NO. 3. 

The nucleotide sequence of MMP-3 linker regions of pAP- 
215 are referred to herein as SEQ ID NO. 4. 
25 The DNA sequence of the pAP-216 insert containing ricin 

and the MMP-3 linker are referred to herein as SEQ ID NO. 5. 

The nucleotide sequence of MMP-7 linker regions of pAP- 
217 are referred to herein as SEQ ID NO. 6. 

The DNA sequence of the pAP-218 insert containing ricin 
30 and the MMP-7 linker are referred to herein as SEQ ID NO. 7. 

The nucleotide sequence of MMP-9 linker regions of pAP- 
219 are referred to herein as SEQ ID NO. 8. 

The DNA sequence of the pAP-220 insert containing ricin 
and the MMP-9 are referred to herein as SEQ ID NO. 9. 
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The nucleotide sequence of thermolysin-like MMP linker 
regions of pAP-221 are referred to herein as SEQ ID NO. 10. 

The DNA sequence of of p AP-222 insert containing ricin and 
the thermolysin-like MMP linker are referred to herein as SEQ ID NO. 11. 
5 The nucleotide sequence of Plasmodium falciparum-A linker 

regions of pAP-223 are referred to herein as SEQ ID NO. 12. 

The DNA sequence of the pAP-224 insert containing ricin 
and the Plasmodium falciparum-A linker are referred to herein as SEQ ID 
NO. 13. 

10 The nucleotide sequence of Plasmodium falciparum-B linker 

regions of pAP-225 are referred to herein as SEQ ID NO. 14. 

The DNA sequence of the pAP-226 insert containing ricin 
and the Plasmodium falciparum-B linker are referred to herein as SEQ ID 
NO. 15. 

15 The nucleotide sequence of Plasmodium falciparum-C linker 

regions of pAP-227 are referred to herein as SEQ ID NO. 16. 

The DNA sequence of the pAP-228 insert containing ricin 
and the Plasmodium falciparum-C linker are referred to herein as SEQ ID 
NO. 17. 

20 The nucleotide sequence of the the Plasmodium falciparum- 

D linker regions of pAP-229 is referred to herein as SEQ ID NO. 18. 

The DNA sequence of the pAP-230 insert containing ricin 
and the Plasmodium falciparum-D linker is referred to herein as SEQ ID 
NO. 19. 

25 The nucleotide sequence of the Plasmodium falciparum-E 

linker regions of pAP~231 is referred to herein as SEQ ID NO. 20. 

The DNA sequence of the pAP-232 insert containing ricin 
and the Plasmodium falciparum-E linker is referred to herein as SEQ ID 
NO. 21. 

30 The nucleotide sequence of the HSV-A linker regions of 

pAP-233 is referred to herein as SEQ ID NO. 22. 

The DNA sequence of the pAP-234 insert containing ricin 
and the HSV-A linker is referred to herein as SEQ ID NO. 23. 
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The nucleotide sequence of the HSV-B linker regions of 
p AP-235 is referred to herein as SEQ ID NO. 24. 

The DNA sequence of the pAP-236 insert containing ricin 
and the HSV-B linker is referred to herein as SEQ ID NO. 25. 
5 The nucleotide sequence of the VZV-A linker regions of 

pAP-237 are referred to herein as SEQ ID NO. 26. 

The DNA sequence of the pAP-238 insert containing ricin 
and the VZV-A linker are referred to herein as SEQ ID NO. 27. 

The nucleotide sequence of the VZV-B linker regions of 
10 PAP-239 is referred to herein as SEQ ID NO. 28. 

The DNA sequence of the pAP-240 insert containing ricin 
and the VZV-B linker is referred to herein as SEQ ID NO. 29. 

The nucleotide sequence of the EBV-A linker regions of 
pAP-241 is referred to herein as SEQ ID NO. 30. 
15 The DNA sequence of the pAP-242 insert containing ricin 

and the EBV-A linker is referred to herein as SEQ ID NO. 31. 

The nucleotide sequence of the EBV-B linker regions of 
pAP-243 is referred to herein as SEQ ID NO. 32. 

The DNA sequence of the pAP-244 insert containing ricin 
20 and the EBV-B linker is referred to herein as SEQ ID NO. 33. 

The nucleotide sequence of the CMV-A linker regions of 
pAP-245 is referred to herein as SEQ ID NO. 34. 

The DNA sequence of the pAP-246 insert containing ricin 
and the CMV-A linker is referred to herein as SEQ ID NO. 35. 
25 The nucleotide sequence of the CMV-B linker regions of 

pAP-247 is referred to herein as SEQ ID NO. 36. 

The DNA sequence of the pAP-248 insert containing ricin 
and the CMV-B linker is referred to herein as SEQ ID NO. 37. 

The nucleotide sequence of the HHV-6 linker regions of 
30 pAP-249 is referred to herein as SEQ ID NO. 38. 

The DNA sequence of the pAP-250 insert containing ricin 
and the HHV-6 linker is referred to herein as SEQ ID NO. 39. 

The amino acid sequences of the cancer protease-sensitive 
amino acid linkers contained in the following pAP proteins have the 
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sequence ID numbers as indicated: pAP-213 and pAP-214 (SEQ ID NO. 40); 
pAP-215 and pAP-216 (SEQ ID NO. 41); pAP-217 and pAP-218; (SEQ ID NO. 
42); pAP-219 and pAP-220 (SEQ ID NO. 43); and pAP-221 and pAP-222 
(SEQ ID NO. 44). 

5 The amino acid sequences of the following cancer protease- 

sensitive linkers are referred to herein with the corresponding sequence ID 
numbers: pAP-241 and pAP-242 (SEQ ID NO. 45); and pAP-243 and pAP- 
244 (SEQ ID NO. 46). 

The nucleotide sequence of the ILV linker regions of pAP- 
10 253 is referred to herein as SEQ ID NO. 47. 

The DNA sequence of the pAP-254 insert containing ricin 
and the ILV linker is referred to herein as SEQ ID NO. 48. 

The nucleotide sequence of the HAV-A linker regions of 
pAP-257 is referred to herein as SEQ ID NO. 49. 
15 The DNA sequence of the pAP-258 insert containing ricin 

and HAV-A linker is referred to herein as SEQ ID NO. 50. 

The nucleotide sequence of the HAV-B linker regions of 
pAP-255 is referred to herein as SEQ ID NO. 51. 

The DNA sequence of the pAP-256 insert containing ricin 
20 and the HAV-B linker is referred to herein as SEQ ID NO. 52. 

The nucleotide sequence of the CAN linker regions of pAP- 
259 is referred to herein as SEQ ID NO. 53. 

The DNA sequence of the pAP-260 insert containing ricin 
and the CAN linker is referred to herein as SEQ ID NO. 54. 
25 The amino acid sequences of Plasmodium falciparum 

protease-sensitive linkers are referred to herein by the sequence ID 
numbers as follows: pAP-223 and pAP-224 (SEQ ID NO 55); pAP-225 and 
pAP-226 (SEQ ID NO 56); pAP-227 and pAP-228 (SEQ ID NO 57); p AP-229 
and pAP-230 (SEQ ID NO 58); and pAP-231 and pAP-232 (SEQ ID NO 59) 
30 (see Figure 26). 

The amino acid sequences of the viral protease-sensitive 
linkers which follow are referred to herein by the sequence ID numbers 
indicated: pAP-233 and pAP 234 (SEQ ID NO 60); pAP-235 and pAP-236 
(SEQ ID NO 61); and pAP-249 and pAP-250 (SEQ ID NO 62) (see Figure 27). 
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The amino acid sequences of the viral protease-sensitive 
linkers which follow are referred to herein by the sequence ID numbers 
indicated: pAP-245 and pAP-246 (SEQ ID NO 63) ; and pAP-247 and pAP- 
248 (SEQ ID NO 64) (see Figure 27). 
5 The amino acid sequences of the viral protease-sensitive 

linkers which follow are referred to herein by the sequence ID numbers 
indicated: pAP-237 and pAP-238 (SEQ ID NO 65); and pAP-239 and pAP- 
240 (SEQ ID NO 66); pAP-253 and pAP-254 (SEQ ID NO 67); pAP-255 and 
pAP-256 (SEQ ID NO 68); and pAP-257 and pAP-258 (SEQ ID NO 69) (see 
10 Figure 27). 

The amino acid sequences of the Candida aspartic protease- 
sensitive linkers are referred to herein by the sequence ID numbers 
indicated: pAP-259 and pAP-260 (SEQ ID NO 70); pAP-261 and pAP-262 
(SEQ ID NO 71); and pAP-263 and pAP-264 (SEQ ID NO 72 ), 

15 An alternative mutagenesis and cloning strategy that can be 

used to generate the disease-specific protease-sensitive linker variants is 
summarized in Figure 29. The first step of this method involves a DNA 
amplification using a set of mutagenic primers in combination with the two 
flanking primers Ricin-109Eco and Ricinl729Pst. Restriction digested PCR 

20 fragments (Eco RI and Pst I) are gel purified. Preproricin variants 
produced from this method can be subcloned directly into the baculovirus 
transfer vector digested with Eco RI and Pst I and intermediate ligation 
steps involving pBluescript SK and pSB2 are circumvented. The cloning 
strategies used to generate disease-specific protease-sensitive linker 

25 variants are summarized in Part A of Figures 30 to 47. With respect to the 
nucleotide sequences and amino acid sequences prepared as a result of the 
implementation of this strategy the following sequences have been 
assigned the sequence ID numbers as indicated. 

The nucleotide sequence of the HCV-A linker region of 

30 pAP-262 is referred to herein as SEQ ID NO. 73. 

The DNA sequence of the pAP-262 insert is referred to 
herein as SEQ ID NO. 74. 

The amino acid sequence of the mutant preproricin linker 
region for HCV-A, pAP-262, is referred to herein as SEQ ID NO. 75. 
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The nucleotide sequence of the HCV-B linker region of 
p AP-264 is referred to herein as SEQ ID NO. 76. 

The DNA sequence of the pAP-264 insert is referred to 
herein as SEQ ID NO. 77. 
5 The amino acid sequence of the mutant preproricin linker 

region for HCV-B, pAP-264, is referred to herein as SEQ ID NO. 78. 

The nucleotide sequence of the HCV-C linker region of 
pAP-266 is referred to herein as SEQ ID NO. 79. 

The DNA sequence of the pAP-266 insert is referred to 
10 herein as SEQ ID NO. 80. 

The amino acid sequence of the mutant preproricin linker 
region for HCV-C, pAP-266, is referred to herein as SEQ ID NO. 81. 

The nucleotide sequence of the HCV-D linker region of 
pAP-268 is referred to herein as SEQ ID NO. 82. 
15 The DNA sequence of the pAP-268 insert is referred to 

herein as SEQ ID NO. 83. 

The amino acid sequence of the mutant preproricin linker 
region for HCV-D , pAP-268, is referred to herein as SEQ ID NO. 84. 

The nucleotide sequence of the MMP-2 linker region of 
20 pAP-270 is referred to herein as SEQ ID NO. 85. 

The DNA sequence of the pAP-270 insert is referred to 
herein as SEQ ID NO. 86. 

The amino acid sequence of the mutant preproricin linker 
region for MMP-2, pAP-270, is referred to herein as SEQ ID NO. 87. 
25 The nucleotide acid sequence of the Cathepsin B (Site 2) 

linker region of pAP-272 is referred to herein as SEQ ID NO. 88. 

The DNA sequence of the pAP-272 insert is referred to 
herein as SEQ ID NO. 89. 

The amino acid sequence of the mutant preproricin linker 
30 region for Cathepsin B (Site 2), pAP-272, is referred to herein as SEQ ID 
NO. 90. 

The nucleotide sequence of the Cathepsin L linker region of 
pAP-274 is referred to herein as SEQ ID NO. 91. 
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The DNA sequence of the pAP-274 insert is referred to 
herein as SEQ ID NO. 92. 

The amino acid sequence of the mutant preproricin linker 
region of Cathepsin L, pAP-274, is referred to herein as SEQ ID NO. 93. 
5 The nucleotide sequence of Cathepsin D linker region of 

pAP-276 is referred to herein as SEQ ID NO. 94. 

The DNA sequence of the pAP-276 insert is referred to 
herein as SEQ ID NO. 95. 

The amino acid sequence of the mutant preproricin linker 
10 region for Cathepsin D, pAP-276, is referred to herein as SEQ ID NO. 96. 

The nucleotide sequence of the MMP-1 linker region of 
p AP-278 is referred to herein as SEQ ID NO. 97. 

The DNA sequence of the pAP-278 insert is referred to 
herein as SEQ ID NO. 98. 
15 The amino acid sequence of the mutant preproricin linker 

region for MMP-1, pAP-278, is referred to herein as SEQ ID NO. 99. 

The nucleotide sequence of the Urokinase-Type 
Plasminogen Activator linker region of pAP-280 is referred to herein as 
SEQ ID NO. 100. 

20 The DNA sequene of the pAP-280 insert is referred to 

herein as SEQ ID NO. 101. 

The amino acid sequence of the mutant preproricin linker 
region for Urokinase-Type Plasminogen Activator, pAP-280, is referred to 
herein as SEQ ID NO. 102. 
25 The nucleotide sequence of MT-MMP linker region of pAP- 

282 is referred to herein as SEQ ID NO. 103. 

The DNA sequence of the pAP-282 insert is referred to 
herein as SEQ ID NO. 104. 

The amino acid sequence of the mutant preproricin linker 
30 region for MT-MMP, pAP-282, is referred to herein as SEQ ID NO. 105. 

The nucleotide sequence of the MMP-1 1 linker region of 
pAP-284 is referred to herein as SEQ ID NO. 106. 

The DNA sequence of the pAP-284 insert is referred to 
herein as SEQ ID NO. 107. 
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The amino acid sequence of the mutant preproricin linker 
region for MMP-11, pAP-284, is referred to herein as SEQ ID NO. 108. 

The nucleotide sequence of the MMP-13 linker region of 
pAP-286 is referred to herein as SEQ ID NO. 109. 
5 The DNA sequence of the pAP-286 insert is referred to 

herein as SEQ ID NO. 110. 

The amino acid sequence of the mutant preproricin linker 
region for MMP-13, pAP-286, is referred to herein as SEQ ID NO. 111. 

The nucleotide sequence of the Tissue-type Plasminogen 
10 Activator linker region of pAP-288 is referred to herein as SEQ ID NO. 112. 

The DNA sequence of the pAP-288 insert is referred to 
herein as SEQ ID NO. 113. 

The amino acid sequence of the mutant preproricin linker 
region for Tissue-type Plasminogen Activator, pAP-288, is referred to 
15 herein as SEQ ID NO. 114. 

The nucleotide sequence of the human Prostate- Specific 
Antigen linker region of pAP-290 is referred to herein as SEQ ID NO. 115. 

The DNA sequence of the pAP-290 insert is referred to 
herein as SEQ ID NO. 116. 
20 The amino acid sequence of the mutant preproricin linker 

region for the human Prostate-Specific Antigen, pAP-290, is referred to 
herein as SEQ ID NO. 117. 

The nucleotide sequence of the kallikrein linker region of 
pAP-292 is referred to herein as SEQ ID NO. 118. 
25 The DNA sequence of the pAP-292 insert is referred to 

herein as SEQ ID NO. 119. 

The amino acid sequence of the mutant preproricin linker 
region for the kallikrein, pAP-292, is referred to herein as SEQ ID NO. 120. 

The nucleotide sequence of the neutrophil elastase linker 
30 region of pAP-294 is referred to herein as SEQ ID NO. 121. 

The DNA sequence of the pAP-294 insert is referred to 
herein as SEQ ID NO. 122. 
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The amino acid sequence of the mutant preproricin linker 
region for neutrophil elastase, pAP-294, is referred to herein as SEQ ID NO. 
123. 

The nucleotide sequence of the calpain linker region of pAP- 

5 296 is referred to herein as SEQ ID NO. 124. 

The DNA sequence of the pAP-296 insert is referred to 
herein as SEQ ID NO. 125. 

The amino acid sequence of the mutant preproricin linker 
region for calpain, pAP-296, is referred to herein as SEQ ID NO. 126. 

10 The amino acid sequence of the wild type linker region is 

referred to herein as SEQ ID NO. 127. 

The nucleic acid molecule of the invention has sequences 
encoding an A chain of a ricin-like toxin, a B chain of a ricin-like toxin and a 
heterologous linker sequence containing a cleavage recognition site for a 

15 disease-specific protease. The nucleic acid may be expressed to provide a 
recombinant protein having an A chain of a ricin-like toxin, a B chain of a 
ricin-like toxin and a heterologous linker sequence containing a cleavage 
recognition site for a disease-specific protease. 

The nucleic acid molecule may comprise the A and /or B 

20 chain of ricin. The ricin gene has been cloned and sequenced, and the 
X-ray crystal structures of the A and B chains are published (Rutenber, E., 
et al. Proteins 10:240-250 (1991); Weston et aL, Mol Biol 244:410-422 (1994); 
Lamb and Lord, Eur, J. Biochem. 14:265 (1985); Hailing, K., et al., Nucleic 
Acids Res. 13:8019 (1985)). It will be appreciated that the invention includes 

25 nucleic acid molecules encoding truncations of A and B chains of ricin like 
proteins and analogs and homologs of A and B chains of ricin-like proteins 
and truncations thereof (i.e., ricin-like proteins), as described herein. It will 
further be appreciated that variant forms of the nucleic acid molecules of 
the invention which arise by alternative splicing of an mRNA 

30 corresponding to a cDNA of the invention are encompassed by the 
invention. 

Another aspect of the invention provides a nucleotide 
sequence which hybridizes under high stringency conditions to a 
nucleotide sequence encoding the A and /or B chains of a ricin-like protein. 
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Appropriate stringency conditions which promote DNA hybridization are 
known to those skilled in the art, or can be found in Current Protocols in 
Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1 6.3.6. For 
example, 6.0 x sodium chloride/sodium citrate (SSC) at about 45°C, 
5 followed by a wash of 2.0 x SSC at 50°C may be employed. The stringency 
may be selected based on the conditions used in the wash step. By way of 
example, the salt concentration in the wash step can be selected from a 
high stringency of about 0.2 x SSC at 50 °C. In addition, the temperature in 
the wash step can be at high stringency conditions, at about 65°C. 

10 The nucleic acid molecule may comprise the A and /or B 

chain of a ricin-like toxin. Methods for cloning ricin-like toxins are known 
in the art and are described, for example, in E.P. 466,222. Sequences 
encoding ricin or ricin-like A and B chains may be obtained by selective 
amplification of a coding region, using sets of degenerative primers or 

15 probes for selectively amplifying the coding region in a genomic or cDNA 
library. Appropriate primers may be selected from the nucleic acid 
sequence of A and B chains of ricin or ricin-like toxins. It is also possible to 
design synthetic oligonucleotide primers from the nucleotide sequences for 
use in PCR. Suitable primers may be selected from the sequences 

20 encoding regions of ricin-like proteins which are highly conserved, as 
described for example in U.S. Patent No 5,101,025 and E.P. 466,222. 

A nucleic acid can be amplified from cDNA or genomic 
DNA using these oligonucleotide primers and standard PCR amplification 
techniques. The nucleic acid so amplified can be cloned into an appropriate 

25 vector and characterized by DNA sequence analysis. It will be appreciated 
that cDNA may be prepared from mRNA, by isolating total cellular mRNA 
by a variety of techniques, for example, by using the 
guanidinium-thiocyanate extraction procedure of Chirgwin et al., 
Biochemistry 18, 5294-5299 (1979). cDNA is then synthesized from the 

30 mRNA using reverse transcriptase (for example, Moloney MLV reverse 
transcriptase available from Gibco/BRL, Bethesda, MD, or AMV reverse 
transcriptase available from Seikagaku America, Inc., St. Petersburg, FL). 
It will be appreciated that the methods described above may be used to 
obtain the coding sequence from plants, bacteria or fungi, preferably 
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plants, which produce known ricin-like proteins and also to screen for the 
presence of genes encoding as yet unknown ricin-like proteins. 

A sequence containing a cleavage recognition site for a 
specific protease may be selected based on the disease or the pathogen 
5 which is to be targeted by the recombinant protein. The cleavage 
recognition site may be selected from sequences known to encode a 
cleavage recognition site for the cancer, viral or parasitic protease. 
Sequences encoding cleavage recognition sites may be identified by testing 
the expression product of the sequence for susceptibility to cleavage by the 

10 respective protease. 

A sequence containing a cleavage recognition site for a viral, 
fungal, parasitic or cancer associated protease may be selected based on 
the retrovirus which is to be targeted by the recombinant protein. The 
cleavage recognition site may be selected from sequences known to 

15 encode a cleavage recognition site for the viral, fungal, parasitic or cancer 
associated protease. Sequences encoding cleavage recognition sites may 
be identified by testing the expression product of the sequence for 
susceptibility to cleavage by a viral, fungal, parasitic or cancer associated 
protease. A polypeptide containing the suspected cleavage recognition site 

20 may be incubated with a protease and the amount of cleavage product 
determined (Dilannit, 1990, J. Biol Chem. 285: 17345-17354 (1990)). 

The protease may be prepared by methods known in the 
art and used to test suspected cleavage recognition sites. 

In one embodiment, the preparation of tumour-associated 

25 cathepsin B, its substrates and enzymatic activity assay methodology have 
been described by Sloane, B.F. et al. (Proc. Natl Acad. Set USA 83:2483-2487 
(1986)), Schwartz, M.IC (Clin. Chitn. Acta 237:67-78 (1995)), and Panchal, 
R.G. et al. (Nature Biotechnol. 14:852-856 (1996)). The preparation of 
Epstein-Barr virus protease, its substrates and enzymatic activity assay 

30 methodology have been described by Welch, A.R. (Proc. Natl Acad. Sci. 
USA 88:10792-10796 (1991)). 

In another embodiment, the preparation of Plasmodium 
falciparum proteases, their substrates and enzymatic activity assay 
methodology have been described by Goldberg, D.E. et al. (/. Exp. Med. 
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173:961-969 (1991)), Cooper & Bujard (Mol Biochem. Parasitol. 56:151-160 
(1992)), Nwagwu, M. et aL (Exp. Parasitol 75:399-414 (1992)), Rosenthal, PJ. 
et al. (J. Clin. Invest. 91:1052-1056 (1993)), Blackman, M.J. et aL (Mol Biochem. 
Parasitol 62:103-114 (1995)). 
5 In a further embodiment, the preparation of proteases from 

human cytomegalovirus, human herpes virus, varicalla zoster virus and 
infectious laryngotracheitis virus have been taught by Liu F. & Roizman, B. 
(/. Virol 65:5149-5156 (1991)) and Welch, A.R. (Proc. Natl Acad. Sci. USA 
88:10792-10796 (1991)). In addition, their respective substrates and 

10 enzymatic activity assay methodologies are also described. 

In another embodiment, the preparation of hepatitis A virus 
protease, its substrates and enzymatic activity assay methodology have 
been described by Jewell, D.A. et al. (Biochemistry 31:7862-7869 (1992)). The 
preparation of poliovirus protease, its substrates and enzymatic activity 

15 assay methodology have been described by Weidner, J.R. et al. (Arch. 
Biochem. Biophys. 286:402-408 (1991)). The preparation of human rhinovirus 
protease, its substrates and enzymatic activity assay methodology have 
been described by Long, A.C. et al. (FEBS Lett 258:75-78 (1989)). 

In another embodiment of the invention, the preparation of 

20 proteases associated with Candida yeasts their substrates and enzymatic 
activity are contemplated, including the aspartic proteinases which have 
been associated specifically with numerous virulent strains of Candida 
including Candida albican, Candida tropicalis, and Candida parapsilosis (Abad- 
Zapatero, C. et aL, Protein Set 5:640-652 (1996); Cutfield, S.M. et aL, 

25 Biochemistry 35:398-410 (1995); Ruchel, R. et al, Zentralbl Bakteriol Mikrobiol 
Hyg. IAbt Orig. A. 255:537-548 (1983); Remold, H. et aL, Biochim. Biophys. 
Acta 167:399-406 (1968)). 

The nucleic acid molecule of the invention may be prepared 
by site directed mutagenesis. For example, the cleavage site of a disease- 

30 specific protease may be prepared by site directed mutagenesis of the 
homologous linker sequence of a proricin-like toxin. Procedures for 
cloning proricin-like genes, encoding a linker sequence are described in EP 
466,222. Site directed mutagenesis may be accomplished by DNA 
amplification of mutagenic primers in combination with flanking primers. 
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Suitable procedures using the mutagenic primers are shown in Parts A and 
B of Figures 1-4, Figures 13-16, Figures 18-36, Figures 38-41, and Figures 
50-67. 

The nucleic acid molecule of the invention may also encode 
5 a fusion protein, A sequence encoding a heterologous linker sequence 
containing a cleavage recognition site for a disease-specific protease may 
be cloned from a cDNA or genomic library or chemically synthesized 
based on the known sequence of such cleavage sites. The heterologous 
linker sequence may then be fused in frame with the sequences encoding 
10 the A and B chains of the ricin-like toxin for expression as a fusion protein, 
p It will be appreciated that a nucleic acid molecule encoding a fusion protein 

may contain a sequence encoding an A chain and a B chain from the same 

y ? 

Ul ricin-like toxin or the encoded A and B chains may be from different toxins. 

IT* For example, the A chain may be derived from ricin and the B chain may 

W 15 be derived from abrin. A protein may also be prepared by chemical 

; conjugation of the A and B chains and linker sequence using conventional 

□ coupling agents for covalent attachment. 

u An isolated and purified nucleic acid molecule of the 

f: invention which is RNA can be isolated by cloning a cDNA encoding an A 

5 20 and B chain and a linker into an appropriate vector which allows for 

transcription of the cDNA to produce an RNA molecule which encodes a 
protein of the invention. For example, a cDN A can be cloned downstream 
of a bacteriophage promoter, (e.g. a T7 promoter) in a vector, cDNA can 
be transcribed in vitro with T7 polymerase, and the resultant RNA can be 
25 isolated by standard techniques. 

Recombinant Protein of the Invention 

As previously mentioned, the invention provides novel 
recombinant proteins which incorporate the A and B chains of a ricin like 
toxin linked by a heterologous linker sequence containing a cleavage 
30 recognition site for a disease-specific protease. It is an advantage of the 
recombinant proteins of the invention that they are non-toxic until the A 
chain is liberated from the B chain by specific cleavage of the linker by the 
target protease. 
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Thus the protein may be used to specifically target cancer 
cells or cells infected with a virus or parasite in the absence of additional 
specific cell-binding components to target infected cells. It is a further 
advantage that the disease-specific protease cleaves the heterologous 
5 linker intracellularly thereby releasing the toxic A chain directly into the 
cytoplasm of the cancer cell or infected cell. As a result, said cells are 
specifically targeted and non-infected normal cells are not directly exposed 
to the activated free A chain. 

Ricin is a plant derived ribosome inhibiting protein which 
10 blocks protein synthesis in eukaryotic cells. Ricin may be derived from the 
q seeds of Ricinus communis (castor oil plant) .The ricin toxin is a glycosylated 

_% 5 heterodimer with A and B chain molecular masses of 30,625 Da and 31,431 

ifi Da respectively. The A chain of ricin has an N-glycosidase activity and 

f7 catalyzes the excision of a specific adenine residue from the 28S rRNA of 

ipsa- 

yl 15 eukaryotic ribosomes (Endo, Y; & Tsurugi, K. J. Biol. Chem. 262:8128 

(1987)). The B chain of ricin, although not toxic in itself, promotes the 
O toxicity of the A chain by binding to galactose residues on the surface of 

72 eukaryotic cells and stimulating receptor-mediated endocytosis of the toxin 

£ molecule (Simmons et al., Biol Chem, 261:7912 (1986)). 

S 20 All protein toxins are initially produced in an inactive, 

precursor form. Ricin is initially produced as a single polypeptide 
(preproricin) with a 35 amino acid N-terminal presequence and 12 amino 
acid linker between the A and B chains. The pre-sequence is removed 
during translocation of the ricin precursor into the endoplasmic reticulum 

25 (Lord, J.M., Eur. f. Biochem. 146:403-409 (1985) and Lord, J.M., Eur. /. 
Biochem. 146:411-416 (1985)). The proricin is then translocated into 
specialized organelles called protein bodies where a plant protease cleaves 
the protein at a linker region between the A and B chains (Lord, J.M. et al., 
FASAB Journal 8:201-208 (1994)). The two chains, however, remain 

30 covalently attached by an interchain disulfide bond (cysteine 259 in the A 
chain to cysteine 4 in the B chain) and mature disulfide linked ricin is stored 
in protein bodies inside plant cells. The A chain is inactive in the proricin 
(O'Hare, M v et al., FEBS Lett 273:200-204 (1990)) and it is inactive in the 
disulfide-linked mature ricin (Richardson, P.T. et al., FEBS Lett. 255:15-20 
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(1989))* The ribosomes of the castor bean plant are themselves susceptible 
to inactivation by ricin A chain; however, as there is no cell surface 
galactose to permit B chain recognition the A chain cannot re-enter the cell. 

Ricin-like proteins include, but are not limited to, bacterial, 
5 fungal and plant toxins which have A and B chains and inactivate 
ribosomes and inhibit protein synthesis. The A chain is an active 
polypeptide subunit which is responsible for the pharmacologic effect of 
the toxin. In most cases the active component of the A chain is an enzyme. 
The B chain is responsible for binding the toxin to the cell surface and is 
10 thought to facilitate entry of the A chain into the cell cytoplasm. The A and 

q B chains in the mature toxins are linked by disulfide bonds. The toxins 

most similar in structure to ricin are plant toxins which have one A chain 

yi and one B chain. Examples of such toxins include abrin which may be 

ff isolated from the seeds of Abrus precatorius, modeccin, volkensin and 

III 15 viscumin. 

p ^ Ricin-like bacterial proteins include diphtheria toxin, which 

Q is produced by Corynebacterium diphtheriae, Pseudomonas enterotoxin A 

and cholera toxin. It will be appreciated that the term ricin-like toxins is 
Hp also intended to include the A chain of those toxins which have only an A 

S 20 chain. The recombinant proteins of the invention could include the A chain 

of these toxins conjugated to, or expressed as, a recombinant protein with 
the B chain of another toxin. Examples of plant toxins having only an A 
chain include trichosanthin, MMC and pokeweed antiviral proteins, 
dianthin 30, dianthin 32, crotin II, curcin II and wheat germ inhibitor. 
25 Examples of fungal toxins having only an A chain include alpha-sarcin, 
restrictocin, mitogillin, enomycin, phenomycin. Examples of bacterial 
toxins having only an A chain include cytotoxin from Shigella dysenteriae 
and related Shiga-like toxins. Recombinant trichosanthin and the coding 
sequence thereof is disclosed in U.S. Patents 5,101,025 and 5,128,460. 
30 In addition to the entire A or B chains of a ricin-like toxin, it 

will be appreciated that the recombinant protein of the invention may 
contain only that portion of the A chain which is necessary for exerting its 
cytotoxic effect. For example, the first 30 amino acids of the ricin A chain 
may be removed resulting in a truncated A chain which retains toxic 
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activity. The truncated ricin or ricin-like A chain may be prepared by 
expression of a truncated gene or by proteolytic degradation, for example 
with Nagarase (Funmatsu et al., Jap. J. Med. Sci. Biol. 23:264-267 (1970)). 
Similarly, the recombinant protein of the invention may contain only that 
5 portion of the B chain necessary for galactose recognition, cell binding and 
transport into the cell cytoplasm. Truncated B chains are described for 
example in E.P. 145,111* The A and B chains may be glycosylated or 
non-glycosylated. Glycosylated A and B chains may be obtained by 
expression in the appropriate host cell capable of glycosylation. 

10 Non-glycosylated chains may be obtained by expression in 
nonglycosylating host cells or by treatment to remove or destroy the 
carbohydrate moieties. 

The proteins of the invention may be prepared using 
recombinant DNA methods. Accordingly, the nucleic acid molecules of the 

15 present invention may be incorporated in a known manner into an 
appropriate expression vector which ensures good expression of the 
protein. Possible expression vectors include but are not limited to cosmids, 
plasmids, or modified viruses (e.g. replication defective retroviruses, 
adenoviruses and adeno-associated viruses), so long as the vector is 

20 compatible with the host cell used. The expression vectors are "suitable for 
transformation of a host cell", which means that the expression vectors 
contain a nucleic acid molecule of the invention and regulatory sequences 
selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid molecule. Operatively linked is 

25 intended to mean that the nucleic acid is linked to regulatory sequences in 
a manner which allows expression of the nucleic acid. 

The invention therefore contemplates a recombinant 
expression vector of the invention containing a nucleic acid molecule of the 
invention, or a fragment thereof, and the necessary regulatory sequences 

30 for the transcription and translation of the inserted protein-sequence. 

Suitable regulatory sequences may be derived from a 
variety of sources, including bacterial, fungal, viral, mammalian, or insect 
genes (For example, see the regulatory sequences described in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, Academic 
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Press, San Diego, CA (1990). Selection of appropriate regulatory sequences 
is dependent on the host cell chosen as discussed below, and may be 
readily accomplished by one of ordinary skill in the art. Examples of such 
regulatory sequences include: a transcriptional promoter and enhancer or 
5 RNA polymerase binding sequence, a ribosomal binding sequence, 
including a translation initiation signal. Additionally, depending on the 
host cell chosen and the vector employed, other sequences, such as an 
origin of replication, additional DNA restriction sites, enhancers, and 
sequences conferring inducibility of transcription may be incorporated into 

10 the expression vector. It will also be appreciated that the necessary 
regulatory sequences may be supplied by the native A and B chains 
and/or its flanking regions. 

The recombinant expression vectors of the invention may 
also contain a selectable marker gene which facilitates the selection of host 

15 cells transformed or transfected with a recombinant molecule of the 
invention. Examples of selectable marker genes are genes encoding a 
protein such as G418 and hygromycin which confer resistance to certain 
drugs, p-galactosidase, chloramphenicol acetyltransferase, firefly luciferase, 
or an immunoglobulin or portion thereof such as the Fc portion of an 

20 immunoglobulin preferably IgG. Transcription of the selectable marker 
gene is monitored by changes in the concentration of the selectable marker 
protein such as p-galactosidase, chloramphenicol acetyltransferase, or 
firefly luciferase. If the selectable marker gene encodes a protein 
conferring antibiotic resistance such as neomycin resistance transformant 

25 cells can be selected with G418. Cells that have incorporated the selectable 
marker gene will survive, while the other cells die. This makes it possible 
to visualize and assay for expression of recombinant expression vectors of 
the invention and in particular to determine the effect of a mutation on 
expression and phenotype. It will be appreciated that selectable markers 

30 can be introduced on a separate vector from the nucleic acid of interest. 

The recombinant expression vectors may also contain genes 
which encode a fusion moiety which provides increased expression of the 
recombinant protein; increased solubility of the recombinant protein; and 
aid in the purification of the target recombinant protein by acting as a 
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ligand in affinity purification. For example, a proteolytic cleavage site may 
be added to the target recombinant protein to allow separation of the 
recombinant protein from the fusion moiety subsequent to purification of 
the fusion protein* Typical fusion expression vectors include pGEX (Amrad 
5 Corp., Melbourne, Australia), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione 
S-transferase (GST), maltose E binding protein, or protein A, respectively, 
to the recombinant protein. 

Recombinant expression vectors can be introduced into host 

10 cells to produce a transformant host cell. The term "transformant host cell" 
is intended to include prokaryotic and eukaryotic cells which have been 
transformed or transfected with a recombinant expression vector of the 
invention. The terms "transformed with", "transfected with", 
"transformation" and "transfection" are intended to encompass 

15 introduction of nucleic acid (e.g, a vector) into a cell by one of many 
possible techniques known in the art. Prokaryotic cells can be transformed 
with nucleic acid by, for example, electroporation or calcium-chloride 
mediated transformation. Nucleic acid can be introduced into mammalian 
cells via conventional techniques such as calcium phosphate or calcium 

20 chloride co-precipitation, DEAE-dextran mediated transfection, lipofectin, 
electroporation or microinjection. Suitable methods for transforming and 
transfecting host cells can be found in Sambrook et al. (Molecular Cloning: 
A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press 
(1989)), and other laboratory textbooks. 

25 Suitable host cells include a wide variety of prokaryotic and 

eukaryotic host cells. For example, the proteins of the invention may be 
expressed in bacterial cells such as E. coli, insect cells (using baculovirus), 
yeast cells or mammalian cells. Other suitable host cells can be found in 
Goeddel, Gene Expression Technology: Methods in Enzymology 185, 

30 Academic Press, San Diego, CA (1991). 

More particularly, bacterial host cells suitable for carrying 
out the present invention include E. coli, B. subtilis, Salmonella typhitnurium , 
and various species within the genus' Pseudomonas, Streptomyces, and 
Staphylococcus, as well as many other bacterial species well known to one 
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of ordinary skill in the art. Suitable bacterial expression vectors preferably 
comprise a promoter which functions in the host cell, one or more 
selectable phenotypic markers, and a bacterial origin of replication. 
Representative promoters include the (^-lactamase (penicillinase) and 
5 lactose promoter system (see Chang et al., Nature 275:615 (1978)), the trp 
promoter (Nichols and Yanofsky, Meth in Enzymology 101:155, (1983) and 
the tac promoter (Russell et al., Gene 20: 231, (1982)). Representative 
selectable markers include various antibiotic resistance markers such as the 
kanamycin or ampicillin resistance genes. Suitable expression vectors 
10 include but are not limited to bacteriophages such as lambda derivatives or 
plasmids such as P BR322 (Bolivar et al., Gene 2:9S, (1977)), the pUC 
plasmids P UC18, P UC19, P UC118, P UC119 (see Messing, Meth in 
Enzymology 101:20-77, 1983 and Vieira and Messing, Gene 19:259-268 
(1982)), and pNH8A, pNH16a, pNH18a, and Bluescript M13 (Stratagene, La 
15 Jolla, Calif.). Typical fusion expression vectors which may be used are 
discussed above, e.g. pGEX (Amrad Corp., Melbourne, Australia), pMAL 
(New England Biolabs, Beverly, MA) and pRITS (Pharmacia, Piscataway, 
NJ). Examples of inducible non-fusion expression vectors include pTrc 
(Amann et al., Gene 69:301-315 (1988)) and pET lid (Studier et al., Gene 
20 Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, California, 60-89 (1990)). 

Yeast and fungi host cells suitable for carrying out the 
present invention include, but are not limited to Saccharomyces cerevisae, the 
genera Pichia or Kluyveromyces and various species of the genus Aspergillus. 
25 Examples of vectors for expression in yeast S. cerivisae include pYepSecl 
(Baldari. et al., Embo J. 6:229-234 (1987)), pMFa (Kurjan and Herskowitz, 
Cell 30:933-943 (1982)), pJRY88 (Schultz et al., Gene 54:113-123 (1987)), and 
pYES2 (Invitrogen Corporation, San Diego, CA). Protocols for the 
transformation of yeast and fungi are well known to those of ordinary skill 
30 in the art.(see Hinnen et al., Proc. Natl. Acad. Sci. USA 75:1929 (1978); Itoh et 
al., /. Bacteriology 153:163 (1983), and Cullen et al. (Bio/Technology 5:369 
(1987)). 

Mammalian cells suitable for carrying out the present 
invention include, among others: COS (e.g., ATCC No. CRL 1650 or 1651), 
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BHK (e.g. ATCC No. CRL 6281), CHO (ATCC No. CCL 61), HeLa (e.g., 
ATCC No. CCL 2), 293 (ATCC No. 1573) and NS-1 cells. Suitable 
expression vectors for directing expression in mammalian cells generally 
include a promoter (e.g., derived from viral material such as polyoma, 
5 Adenovirus 2, cytomegalovirus and Simian Virus 40), as well as other 
transcriptional and translational control sequences. Examples of 
mammalian expression vectors include pCDM8 (Seed, B., Nature 329:840 
(1987)) and pMT2PC (Kaufman et al., EMBO J. 6:187-195 (1987)). 

Given the teachings provided herein, promoters, 

10 terminators, and methods for introducing expression vectors of an 
appropriate type into plant, avian, and insect cells may also be readily 
accomplished. For example, within one embodiment, the proteins of the 
invention may be expressed from plant cells (see Sinkar et al., J. Biosci 
(Bangalore) 11:47-58 (1987), which reviews the use of Agrobacterium 

15 rhizogenes vectors; see also Zambryski et al., Genetic Engineering, 
Principles and Methods, Hollaender and Setlow (eds.), Vol. VI, pp. 253-278, 
Plenum Press, New York (1984), which describes the use of expression 
vectors for plant cells, including, among others, pAS2022, pAS2023, and 
pAS2034). 

20 Insect cells suitable for carrying out the present invention 

include cells and cell lines from Bombyx, Trichoplusia or Spodotera species. 
Baculovirus vectors available for expression of proteins in cultured insect 
cells (SF 9 cells) include the pAc series (Smith et aL, Mol Cell Biol 3:2156- 
2165 (1983)) and the pVL series (Lucklow, V.A., and Summers, M.D., 

25 Virology 170:31-39 (1989)). Some baculovirus-insect cell expression systems 
suitable for expression of the recombinant proteins of the invention are 
described in PCT/US/02442. 

Alternatively, the proteins of the invention may also be 
expressed in non-human transgenic animals such as, rats, rabbits, sheep 

30 and pigs (Hammer et al. Nature 315:680-683 (1985); Palmiter et al. Science 
222:809-814 (1983); Brinster et al. Proc. Natl Acad. Sci. USA 82:4438-4442 
(1985); Palmiter and Brinster Cell 41:343-345 (1985) and U.S. Patent No. 
4,736,866). 
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The proteins of the invention may also be prepared by 
chemical synthesis using techniques well known in the chemistry of 
proteins such as solid phase synthesis (Merrifield, J. Am. Chem. Assoc. 
85:2149-2154 (1964)) or synthesis in homogenous solution (Houbenweyl, 
5 Methods of Organic Chemistry, ed. E. Wansch, Vol. 15 I and II, Thieme, 
Stuttgart (1987)). 

The present invention also provides proteins comprising an 
A chain of a ricin-like toxin, a B chain of a ricin-like toxin and a 
heterologous linker amino acid sequence linking the A and B chains, 

10 wherein the linker sequence contains a cleavage recognition site for a 
disease-specific protease. Such a protein could be prepared other than by 
recombinant means, for example by chemical synthesis or by conjugation 
of A and B chains and a linker sequence isolated and purified from their 
natural plant, fungal or bacterial source. Such A and B chains could be 

15 prepared having the glycosylation pattern of the native ricin-like toxin. 

N-terminal or C-terminal fusion proteins comprising the 
protein of the invention conjugated with other molecules, such as proteins 
may be prepared by fusing, through recombinant techniques. The 
resultant fusion proteins contain a protein of the invention fused to the 

20 selected protein or marker protein as described herein. The recombinant 
protein of the invention may also be conjugated to other proteins by 
known techniques. For example, the proteins may be coupled using 
heterobifunctional thiol-containing linkers as described in WO 90/10457, 
N-succinimidyl-3-(2-pyridyldithio-proprionate) or N-succinimidyl-5 

25 thioacetate. Examples of proteins which may be used to prepare fusion 
proteins or conjugates include cell binding proteins such as 
immunoglobulins, hormones, growth factors, lectins, insulin, low density 
lipoprotein, glucagon, endorphins, transferrin, bombesin, 
asialoglycoprotein glutathione-S-transferase (GST), hemagglutinin (HA), 

30 and truncated myc. 

Utility of the Nucleic Acid Molecules and Proteins of the Invention 

The proteins of the invention may be used to specifically 
inhibit or destroy mammalian cells affected by a disease or infection which 
have associated with such cells a specific protease, i.e., disease-specific, for 
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example cancer cells or cells infected with a virus, fungus or parasite, all of 
which are encompased within the term "disease-specific/ 1 It is an 
advantage of the recombinant proteins of the invention that they have 
specificity for said cells without the need for a cell binding component. The 
5 ricin-like B chain of the recombinant proteins recognize galactose moieties 
on the cell surface and ensure that the protein is taken up by the diseased 
cell and released into the cytoplasm. When the protein is internalized into 
a non-infected cell, cleavage of the heterologous linker would not occur in 
the absence of the disease-specific protease and the A chain will remain 

10 inactive bound to the B chain. Conversely, when the protein is internalized 
into a diseased cell, the disease-specific protease will cleave the cleavage 
recognition site in the linker thereby releasing the toxic A chain. 

The specificity of a recombinant protein of the invention 
may be tested by treating the protein with the disease-specific protease 

15 which is thought to be specific for the cleavage recognition site of the 
linker and assaying for cleavage products. Disease-specific proteases may 
be isolated from cancer cells or infected cells, or they may be prepared 
recombinantly, for example following the procedures in Darket et al. (J. 
Biol Chem. 254:2307-2312 (1988)). The cleavage products may be identified 

20 for example based on size, antigenicity or activity. The toxicity of the 
recombinant protein may be investigated by subjecting the cleavage 
products to an in vitro translation assay in cell lysates, for example using 
Brome Mosaic Virus mRNA as a template. Toxicity of the cleavage 
products may be determined using a ribosomal inactivation assay (Westby 

25 et al., Bioconjugate Chem. 3:377-382 (1992)). The effect of the cleavage 
products on protein synthesis may be measured in standardized assays of 
in vitro translation utilizing partially defined cell free systems composed for 
example of a reticulocyte lysate preparation as a source of ribosomes and 
various essential cofactors, such as mRNA template and amino acids. Use 

30 of radiolabeled amino acids in the mixture allows quantitation of 
incorporation of free amino acid precursors into trichloroacetic acid 
precipitable proteins. Rabbit reticulocyte lysates may be conveniently 
used (O'Hare, FEBS Lett 273:200-204 (1990)). 



-50- 



The ability of the recombinant proteins of the invention to 
selectively inhibit or destroy animal cancer cells or cells infected with a 
virus or parasite may be readily tested in vitro using animal cancer cell lines 
or cell cultures infected with the virus or parasite of interest. The selective 
5 inhibitory effect of the recombinant proteins of the invention may be 
determined, for example, by demonstrating the selective inhibition of viral 
antigen expression in infected mammalian cells, the selective inhibition of 
general mRNA translation and protein synthesis in diseased cells, or 
selective inhibition of cellular proliferation in cancer cells or infected cells. 

10 Toxicity may also be measured based on cell viability, for 

example the viability of infected and non-infected cell cultures exposed to 
the recombinant protein may be compared. Cell viability may be assessed 
by known techniques, such as trypan blue exclusion assays. 

In another example, a number of models may be used to 

15 test the cytotoxicity of recombinant proteins having a heterologous linker 
sequence containing a cleavage recognition site for a cancer-associated 
matrix metalloprotease. Thompson, E.W. et al. (Breast Cancer Res. 
Treatment 31:357-370 (1994)) has described a model for the determination 
of invasiveness of human breast cancer cells in vitro by measuring tumour 

20 cell-mediated proteolysis of extracellular matrix and tumour cell invasion 
of reconstituted basement membrane (collagen, laminin, fibronectin, 
Matrigel or gelatin). Other applicable cancer cell models include cultured 
ovarian adenocarcinoma cells (Young, T.N. et al. Gynecol. Oncol 62:89-99 
(1996); Moore, D.H. et al. Gynecol Oncol 65:78-82 (1997)), human follicular 

25 thyroid cancer cells (Demeure, MJ. et al, World }. Surg. 16:770-776 (1992)), 
human melanoma (A-2058) and fibrosarcoma (HT-1080) cell lines (Mackay, 
A.R. et al. Lab. Invest 70:781-783 (1994)), and lung squamous (HS-24) and 
adenocarcinoma (SB-3) cell lines (Spiess, E. et al. J. Histochem. Cytochem. 
42:917-929 (1994)). An in vivo test system involving the implantation of 

30 tumours and measurement of tumour growth and metastasis in athymic 
nude mice has also been described (Thompson, E.W. et al., Breast Cancer 
Res. Treatment 31:357-370 (1994); Shi, Y.E. et al., Cancer Res. 53:1409-1415 
(1993)). 
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A further model may be used to test the cytotoxicity of 
recombinant proteins having a heterologous linker sequence containing a 
cleavage recognition site for a cancer-associated Cathepsin B protease is 
provided in human glioma (Mikkelsen, T. et al. J, Neurosurge, 83:285-290 
5 (1995)). 

Similarly, the cytotoxicity of recombinant proteins having a 
heterologous linker sequence containing a cleavage recognition site for a 
malarial protease may be tested by a Plasmodium invasion assay using 
human erythrocytes infected with mature-stage merozoite parasites as 
10 described by McPherson, R.A. et aL (MoL Biochern, ParasitoL 62:233-242 
(1993)). Alternatively, in vitro cultures of human hepatic parenchymal cells 
may be used to evaluate schizont infectivity and Plasmodium merozoite 
generation. 

With respect to models of viral infection and replication, 

15 suitable animal cells which can be cultured in vitro and which are capable of 
maintaining viral replication can be used as hosts. The toxicity of the 
recombinant protein for infected and non-infected cultures may then be 
compared. The ability of the recombinant protein of the invention to 
inhibit the expression of these viral antigens may be an important indicator 

20 of the ability of the protein to inhibit viral replication. Levels of these 
antigens may be measured in assays using labelled antibodies having 
specificity for the antigens. Inhibition of viral antigen expression has been 
correlated with inhibition of viral replication (U.S. Patent No. 4,869,903). 
Toxicity may also be assessed based on a decrease in protein synthesis in 

25 target cells, which may be measured by known techniques, such as 
incorporation of labelled amino acids, such as [3H] leucine (O'Hare et aL, 
FEBS Lett. 273:200-204 (1990)). Infected cells may also be pulsed with 
radiolabeled thymidine and incorporation of the radioactive label into 
cellular DNA may be taken as a measure of cellular proliferation. Toxicity 

30 may also be measured based on cell death or lysis, for example, the 
viability of infected and non-infected cell cultures exposed to the 
recombinant protein may be compared. Cell viability may be assessed by 
known techniques, such as trypan blue exclusion assays. 



-52- 



Although the primary specificity of the proteins of the 
invention for diseased cells is mediated by the specific cleavage of the 
cleavage recognition site of the linker, it will be appreciated that specific 
cell binding components may optionally be conjugated to the proteins of 
5 the invention. Such cell binding components may be expressed as fusion 
proteins with the proteins of the invention or the cell binding component 
may be physically or chemically coupled to the protein component. 
Examples of suitable cell binding components include antibodies to cancer, 
viral or parasitic proteins. 

10 Antibodies having specificity for a cell surface protein may 

be prepared by conventional methods. A mammal, (e.g. a mouse, 
hamster, or rabbit) can be immunized with an immunogenic form of the 
peptide which elicits an antibody response in the mammal. Techniques for 
conferring immunogenicity on a peptide include conjugation to carriers or 

15 other techniques well known in the art. For example, the peptide can be 
administered in the presence of adjuvant. The progress of immunization 
can be monitored by detection of antibody titers in plasma or serum. 
Standard ELISA or other immunoassay procedures can be used with the 
immunogen as antigen to assess the levels of antibodies. Following 

20 immunization, antisera can be obtained and, if desired, polyclonal 
antibodies isolated from the sera. 

To produce monoclonal antibodies, antibody producing 
cells (lymphocytes) can be harvested from an immunized animal and fused 
with myeloma cells by standard somatic cell fusion procedures thus 

25 immortalizing these cells and yielding hybridoma cells. Such techniques 
are well known in the art, (e.g. the hybridoma technique originally 
developed by Kohler and Milstein (Nature 256:495-497 (1975)) as well as 
other techniques such as the human B-cell hybridoma technique (Kozbor 
et al., Immunol Today 4:72 (1983)), the EBV-hybridoma technique to 

30 produce human monoclonal antibodies (Cole et al., Monoclonal Antibodies 
in Cancer Therapy Allen R., Bliss, Inc., pages 77-96 (1985)), and screening of 
combinatorial antibody libraries (Huse et al., Science 246:1275 (1989)). 
Hybridoma cells can be screened immunochemically for production of 
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antibodies specifically reactive with the peptide and the monoclonal 
antibodies can be isolated. 

The term "antibody" as used herein is intended to include 
fragments thereof which also specifically react with a cell surface 
5 component. Antibodies can be fragmented using conventional techniques 
and the fragments screened for utility in the same manner as described 
above. For example, F(ab')2 fragments can be generated by treating 
antibody with pepsin. The resulting F(ab')2 fragment can be treated to 
reduce disulfide bridges to produce Fab' fragments. 

10 Chimeric antibody derivatives, i.e., antibody molecules that 

combine a non-human animal variable region and a human constant 
region are also contemplated within the scope of the invention. Chimeric 
antibody molecules can include, for example, the antigen binding domain 
from an antibody of a mouse, rat, or other species, with human constant 

15 regions. Conventional methods may be used to make chimeric antibodies 
containing the immunoglobulin variable region which recognizes a cell 
surface antigen (See, for example, Morrison et aL, Proc. Natl Acad. Set 
U.S.A. 81:6851 (1985); Takeda et aL, Nature 314:452 (1985), Cabilly et aL, 
U.S. Patent No. 4,816,567; Boss et aL, U.S. Patent No. 4,816,397; Tanaguchi 

20 et aL, E.P. Patent No. 171,496; European Patent No. 173,494, United 
Kingdom Patent No. GB 2177096B). It is expected that chimeric antibodies 
would be less immunogenic in a human subject than the corresponding 
non-chimeric antibody. 

Monoclonal or chimeric antibodies specifically reactive 

25 against cell surface components can be further humanized by producing 
human constant region chimeras, in which parts of the variable regions, 
particularly the conserved framework regions of the antigen-binding 
domain, are of human origin and only the hypervariable regions are of 
non-human origin. Such immunoglobulin molecules may be made by 

30 techniques known in the art, (e.g. Teng et aL, Proc. Natl. Acad. Sci. U.S.A., 
80:7308-7312 (1983); Kozbor et aL, Immunology Today 4:7279 (1983); Olsson 
et aL, Meth. EnzymoL, 92:3-16 (1982), and PCT Publication WO92/06193 or 
EP 239,400). Humanized antibodies can also be commercially produced 
(Scotgen Limited, 2 Holly Road, Twickenham, Middlesex, Great Britain.) 
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Specific antibodies, or antibody fragments, reactive against 
cell surface components may also be generated by screening expression 
libraries encoding immunoglobulin genes, or portions thereof, expressed 
in bacteria with cell surface components. For example, complete Fab 
5 fragments, VH regions and FV regions can be expressed in bacteria using 
phage expression libraries (See for example Ward et aL, Nature 341:544-546 
(1989); Huse et aL, Science 246:1275-1281 (1989); and McCafferty et aL, 
Nature 348:552-554 (1990)). Alternatively, a SCID-hu mouse, for example 
the model developed by Genpharm, can be used to produce antibodies, or 

10 fragments thereof. 

The proteins of the invention may be formulated into 
pharmaceutical compositions for adminstration to subjects in a biologically 
compatible form suitable for administration in vivo. By "biologically 
compatible form suitable for administration in vivo 11 is meant a form of the 

15 substance to be administered in which any toxic effects are outweighed by 
the therapeutic effects. The substances may be administered to living 
organisms including humans, and animals. Administration of a 
therapeutically active amount of the pharmaceutical compositions of the 
present invention is defined as an amount effective, at dosages and for 

20 periods of time necessary to achieve the desired result. For example, a 
therapeutically active amount of a substance may vary according to factors 
such as the disease state, age, sex, and weight of the individual, and the 
ability of antibody to elicit a desired response in the individual. Dosage 
regime may be adjusted to provide the optimum therapeutic response. 

25 For example, several divided doses may be administered daily or the dose 
may be proportionally reduced as indicated by the exigencies of the 
therapeutic situation. 

The nucleic acid molecules of the invention may be 
formulated into pharmaceutical compositions for adminstration to subjects 

30 in a biologically compatible form suitable for administration in vivo. By 
"biologically compatible form suitable for administration in vivo" is meant 
a form of the substance to be administered in which any toxic effects are 
outweighed by the therapeutic effects. The substances may be 
administered to living organisms including humans, and animals. 



-55- 



Administration of a therapeutically active amount of the pharmaceutical 
compositions of the present invention is defined as an amount effective, at 
dosages and for periods of time necessary to achieve the desired result. 
For example, a therapeutically active amount of a substance may vary 
5 according to factors such as the disease state, age, sex, and weight of the 
individual, and the ability of antibody to elicit a desired response in the 
individual. Dosage regime may be adjusted to provide the optimum 
therapeutic response. For example, several divided doses may be 
administered daily or the dose may be proportionally reduced as indicated 

10 by the exigencies of the therapeutic situation. 

The active substance may be administered in a convenient 
manner such as by injection (subcutaneous, intravenous, intramuscular, 
etc.), oral administration, inhalation, transdermal administration (such as 
topical cream or ointment, etc.), or suppository applications. Depending 

15 on the route of administration, the active substance may be coated in a 
material to protect the compound from the action of enzymes, acids and 
other natural conditions which may inactivate the compound. 

The compositions described herein can be prepared by per 
se known methods for the preparation of pharmaceutically acceptable 

20 compositions which can be administered to subjects, such that an effective 
quantity of the active substance is combined in a mixture with a 
pharmaceutically acceptable vehicle. Suitable vehicles are described, for 
example, in Remington's Pharmaceutical Sciences (Remington's 
Pharmaceutical Sciences, Mack Publishing Company, Easton, Pa., USA 

25 1985). On this basis, the compositions include, albeit not exclusively, 
solutions of the substances in association with one or more 
pharmaceutically acceptable vehicles or diluents, and contained in buffered 
solutions with a suitable pH and iso-osmotic with the physiological fluids. 

The pharmaceutical compositions may be used in methods 

30 for treating animals, including mammals, preferably humans, with cancer 
or infected with a virus or a parasite. It is anticipated that the compositions 
will be particularly useful for treating patients with B-cell 
lymphoproliferative disease, (melanoma), mononucleosis, cytomegalic 
inclusion disease, malaria, herpes, shingles, hepatitis, poliomyelitis, or 
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infectious laryngotracheitis. The dosage and type of recombinant protein 
to be administered will depend on a variety of factors which may be 
readily monitored in human subjects. Such factors include the etiology and 
severity (grade and stage) of neoplasia, the stage of malarial infection (e.g. 
5 exoerythrocytic vs. erythrocytic), or antigen levels associated with viral 
load in patient tissues or circulation. 

As mentioned above, the novel recombinant toxic proteins 
and nucleic acid molecules of the present invention are useful in treating 
cancerous or infected cells wherein the cells contain a specific protease that 

10 can cleave the linker region of the recombinant toxic protein. One skilled 
in the art can appreciate that many different recombinant toxic proteins 
can be prepared once a disease associated protease has been identified. 
For example,the novel recombinant toxic proteins and nucleic acid 
molecules of the invention may be used to treat CNS tumors. Muller et al. 

15 (1993) describe increased activity of Insulin-type Growth Factor Binding 
Protein-3 (IGFBP-3) protease in the Cerebral Spinal Fluid of patients with 
CNS tumors. Cohen et al. (1992) claim that prostate-specific antigen (PSA) 
is an IGFBP-3 protease. The pAP290 construct described above is a 
substrate for PSA. Conover et al. (1994) claim that cathepsin D is IGFBP-3 

20 protease. The pAP276 described herein is a substrate for cathepsin D. 
Another example of a specific use of the invention is treatment of human 
glioma which has been shown to produce cathepsin D (Mikkelsen, T. et al. 
J. Neurosurge, 83:285-290 (1995)). The pAP 214 and 272 define herein are 
substrates for cathepsin B. 

25 In addition, the novel proteins and nucleic acid molecules of 

the present invention may be used to treat cystic fibrosis. Hansen et al. 
(1995) describe how CF airway disease is characterized by neutrophil- 
dominated chronic inflammation with an excess of uninhibited neutrophil 
elastase (NE). NE levels in CF sputum are 350 times higher than that 

30 found in normal sputum. The pAP294 described herein is a substrate for 
neutrophil elastase. 

As well, the novel proteins and nucleic acid molecules of the 
present invention may also be used to treat multiple sclerosis. Bever Jr. et 
al. (1994) implicate cathepsin B (possibly from inflammatory cells of 
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hematogenous origin) in the demyelination found in multiple sclerosis. 
pAPs 214 and 272 defined herein present substrates for cathepsin B. 

The term "animal" as used herein includes all members of 
the animal kingdom including mammals, preferably humans. 
5 The following non-limiting examples are illustrative of the 

present invention: 

EXAMPLES 

Example 1 

Cloning and Expression of Froricin Variants Activated by Disease- 
10 Specific Proteases 

O Isolation of total RNA 

jji The preproricin gene was cloned from new foliage of the 

castor bean plant. Total messenger RNA was isolated according to 
y, established procedures (Sambrook et al., Molecular Cloning: A Lab Manual 

}J 15 (Cold Spring Harbour Press, Cold Spring Harbour, (1989)) and cDNA 

s generated using reverse transcriptase. 

~£ cDNA Synthesis : 

M Oligonucleotides, corresponding to the extreme 5' and 3 f 

S ends of the preproricin gene were synthesized and used to PCR amplify 

O 20 the gene. Using the cDNA sequence for preproricin (Lamb et aL, Eur. J. 

Biochem., 145:266-270, 1985), several oligonucleotide primers were 
designed to flank the start and stop codons of the preproricin open reading 
frame. The oligonucleotides were synthesized using an Applied 
Biosystems Model 392 DNA/RNA Synthesizer. First strand cDNA 
25 synthesis was primed using the oligonucleotide Ricinl729C (Table 1). 
Three micrograms of total RNA was used as a template for oligo 
Ricinl729C primed synthesis of cDNA using Superscript II Reverse 
Transcriptase (BRL) following the manufacturer's protocol. 
DNA Amplification and Cloning 
30 The first strand cDNA synthesis reaction was used as 

template for DNA amplification by the polymerase chain reaction (PCR). 
The preproricin cDNA was amplified using the upstream primer Ricin-99 
and the downstream primer Ricinl729C with Vent DNA polymerase (New 
England Biolabs) using standard procedures (Sambrook et al v Molecular 



-58- 



Cloning; A Laboratory Manual, Second Edition, (Cold Spring Harbor 
Laboratory Press, 1989)). Amplification was carried out in a Biometra 
thermal cycler (TRIO-Thermalcycler) using the following cycling 
parameters: denaturation 95°C for 1 min., annealing 52°C for 1 min., and 
5 extension 72°C for 2 min., (33 cycles), followed by a final extension cycle at 
72°C for 10 min. The 1846bp amplified product was fractionated on an 
agarose gel (Sambrook et al v Molecular Cloning: A Laboratory Manual, 
Second Edition, (Cold Spring Harbor Laboratory Press, 1989), and the 
DNA purified from the gel slice using Qiaex resin (Qiagen) following the 

10 manufacturer's protocol. The purified PCR fragment encoding the 
preproricin cDNA was then ligated (Sambrook et aL, Molecular Cloning: 
A Laboratory Manual, Second Edition, (Cold Spring Harbor Laboratory 
Press, 1989)) into an Eco RV-digested pBluescript II SK plasmid 
(Stratagene), and used to transform competent XLl-Blue cells (Stratagene). 

15 Positive clones were confirmed by restriction digestion of purified plasmid 
DNA. Plasmid DNA was extracted using a Qiaprep Spin Plasmid Miniprep 
Kit (Qiagen). 
DNA Sequencing 

The cloned PCR product containing the putative preproricin 

20 gene was confirmed by DNA sequencing of the entire cDNA clone (pAP- 
144). Sequencing was performed using an Applied Biosystems 373 A 
Automated DNA Sequencer, and confirmed by double-stranded dideoxy 
sequencing by the Sanger method using the Sequenase kit (USB). The 
oligonucleotide primers used for sequencing were as follows: Ricin267, 

25 Ricin486, Ricin725, Ricin937, RicinllSl, Ricinil399, Ricinl627, T3 primer 
(5'AATTAACCCTCACTAAAGGG-3') (SEQ ID NO. 128) and T7 primer 
(5'GTAATACGACTCACTATAGGGC-3) (SEQ ID NO. 129). Sequence data 
was compiled and analyzed using PC Gene software package 
(intelligenetics). The sequences and location of oligonucleotide primers is 

30 shown in Table 1. The oligonucleotide primers shown in Table 1 have been 
assigned the following sequence ID numbers: 
Ricin-109 is referred to herein as SEQ ID NO. 130; 
Ricin-99Eco is referred to herein as SEQ ID NO. 131; 
Ricin267 is referred to herein as SEQ ID NO. 132; 
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Ricin486 is referred to herein as SEQ ID NO. 133; 
Ricin725 is referred to herein as SEQ ID NO. 134; 
Ricin 937 is referred to herein as SEQ ID NO. 135; 
Ricin 1151 is referred to herein as SEQ ID NO. 136; 
5 Ricin 1399 is referred to herein as SEQ ID NO. 137; 
Ricin 1627 is referred to herein as SEQ ID NO. 138; 
Ricin 1729C is referred to herein as SEQ ID NO. 139; and 
Ricin 1729C Xba is referred to herein as SEQ ID NO. 140. 
Production and Cloning of Linker Variants 

10 pAP144 cut with EcoRI was used as target for PCR pairs 

employing the Ricinl09-Eco oligonucleotide (Ricin-109Eco primer; 5- 
GGAGGAATCCGGAGATGAAACCGGGAGGAAA TACT ATT GTAAT-3 
(SEQ ID No. 141)) and a mutagenic primer for the 5' half of the linker as 
well as the Ricinl729PstI primer (Ricinl729-Pstl: 5- 

15 GTAGGCGCTGCAGATAACTTGCTGTCCTTTCAG-3 (SEQ ID No. 142)) 
and a mutagenic primer for the 3 1 half of the linker. The cycling conditions 
used for the PCRs were 98 degrees C for 2 min.; 98C 1 min., 52C 1 min., 
72C 1 min. 15 sec. (30 cycles); 72 degrees C lOmin.; 4 degrees C soak. The 
PCR products were then digested by EcoRI and PstI respectively, 

20 electrophoresed on an agarose gel, and the bands purified by via glass 
wool spin columns. Triple ligations comprising the PCR product pairs 
(corresponding halves of the new linker) and pVL1393 vector digested 
with EcoRI and PstI were carried out. Recombinant clones were identified 
by restriction digests of plasmid miniprep DNA and the altered linkers 

25 confirmed by DNA sequencing. See Figure 45 as an example of the cloning 
strategy. Recombinant clones were identified by restriction digests of 
plasmid miniprep DNA and the altered linkers confirmed by DNA 
sequencing. Note that since all altered linker variants were cloned directly 
into the pVL1393 vector odd-numbered pAPs were no longer required or 

30 produced. 

Isolation of Recombinant Baculoviruses 

Insect cells S. frugiperda (Sf9), and Trichoplusia ni (Tn368 and 
BTI-TN-581-4 (High Five)) were maintained on EX-CELL 405 medium 
(JRH Biosciences) supplemented with 10% total calf serum (Summers et al., 
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A Manual of Methods of Baculovirus Vectors and Insect Cell Culture 
Procedures, (Texas Agricultural Experiment Station, 1987)). Two 
micrograms of recombinant pVL1393 DNA was co-transfected with 0.5 
microgram of BaculoGold AcNPV DNA (Pharmingen) into 2 x 10 6 Tn368 
5 insect cells following the manufacturer's protocol (Gruenwald et aL, 
Baculovirus Expression Vector System: Procedures and Methods Manual, 
2nd Edition, (San Diego, CA, 1993)). On day 5 post-transfection, media 
were centrifuged and the supernatants tested in limiting dilution assays 
with Tn368 cells (Summers et aL, A Manual of Methods of Baculovirus 

10 Vectors and Insect Cell Culture Procedures, (Texas Agricultural 
Experiment Station, 1987)). Recombinant viruses in the supernatants were 
then amplified by infecting Tn368 cells at a multiplicity of infection (moi) of 
0.1, followed by collection of day 3 to 5 supernatants. A total of three 
rounds of amplification were performed for each recombinant following 

15 established procedures (Summers et al., A Manual of Methods of 
Baculovirus Vectors and Insect Cell Culture Procedures, (Texas 
Agricultural Experiment Station, 1987 and Gruenwald et aL, Baculovirus 
Expression Vector System: Procedures and Methods Manual, 2nd Edition, 
(San Diego, CA, 1993)). 

20 Expression of Mutant Proricin 

Recombinant baculoviruses were used to infect 1X10 7 Tn368 
or sf9 cells at an moi of 9 in EX-CELL 405 media (JRH Biosciences) with 
25mM a-lactose in spinner flasks. Media supernatants containing mutant 
proricins were collected 3 or 4 days post-infection. 

25 EXAMPLE 2 

Harvesting and affinity column purification of pro-ricin variants 

Protein samples were harvested three days post 
transfection. The cells were removed by centrifuging the media at 8288 g 
for ten minutesusing a GS3 (Sorvall) centrifuge rotor. The supernatant was 

30 further clarified by centrifuging at 25400 g using a SLA-1500 rotor (Sorvall) 
for 45 minutes. Protease inhibitor phenylmethylsulfonyl fluoride (Sigma) 
was slowly added to a final concentration of ImM. The samples were 
further prepared by adding lactose to a concentration of 20 mM (not 
including the previous lactose contained in the expression medium). The 
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samples were concentrated to 700 mL using a Prep/Scale-TFF Cartridge 
(2.5ft, 10K regenerated cellulose (Millipore)) and a Masterflex pump. The 
samples were then dialysed for 2 days in IX Column Buffer (50 mM Tris, 
100 mM NaCl, 0.02% NaN3, pH 7.5) using dialysis tubing (10 K MWCO, 32 
5 mm flat width(Spectra/Por)). Subsequently, the samples were clarified by 
centri fuging at 25400 g using a SLA-1500 rotor (Sorvall) for 45 minutes. 

Following centrifugation, the samples were degassed and 
applied at 4 degrees C to a XK26/20 (Pharmacia) column (attached to a 
Pharmacia peristaltic pump, Pharmacia Single-path Monitor UV-1 Control 

10 and Optical Units, and Bromma LKB 2210 2-Channel Recorder) containing 
20 mL of a-Lactose Agarose Resin (Sigma). The column was washed for 3 
hours with IX Column buffer. Elution of pro-ricin variant was performed 
by eluting with buffer (IX Column buffer (0.1% NaN3), 100 mM Lactose) 
until the baseline was again restored. The samples were concentrated 

15 using an Amicon 8050 concentrator (Amicon) with a YM10 76 mm 
membrane, utilizing argon gas to pressurize the chamber. The samples 
were further concentrated in Centricon 10 (Millipore) concentrators 
according to manufacturer's specifications. 

Purification of Variant pAP-Protein by gel filtration chromatography 

20 In order to purify the pro-ricin variant from processed 

material produced during fermentation, the protein was applied to a 
SUPERDEX 75 (16/60) column and SUPERDEX 200 (16/60) column 
(Pharmacia) connected in series equilibrated with 50 mM Tris, lOOmM 
NaCl, pH 7.5 containing 100 mM Lactose and 0.1% p-mercaptoethanol 

25 (pME). The flow rate of the column was 0.15 mL/min and fractions were 
collected every 25 minutes. The UV (280 nm) trace was used to determine 
the approximate location of the purified pAP-protein and thus determine 
the samples for Western analysis. 
Western analysis of column fractions 

30 Fractions eluted from the SUPERDEX columns (Pharmacia) 

were analyzed for purity using standard Western blotting techniques. An 
aliquot of 10[JiL from each fraction was boiled in IX sample buffer (62.6 
mM Tris-Cl, pH 6.8, 4.4% pME, 2% sodium dodecyl sulfate (SDS), 5% 
glycerol (all from Sigma) and 0.002% bromophenol blue (Biorad)) for five 
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minutes. Denatured samples were loaded on 12% Tris-Glycine Gels 
(Biorad) along with 50 ng of RCA 60 (Sigma) and 5 (xL of kaleidoscope 

prestained standards (Biorad). Electrophoresis was carried out for ninety 
minutes at 100V in 25 mM Tris-Cl, pH 8.3, 0.1% SDS, and 192 mM glycine 
5 using the BioRad Mini Protean II cells (Biorad). 

Following electrophoresis gels were equilibrated in transfer 
buffer (48 mM Tris, 39 mM glycine, 0.0375% SDS, and 20% Methanol) for a 
few minutes. PVDF Biorad membrane was presoaked for one minute in 
100% methanol, rinsed in ddH 2 0 and two minutes in transfer buffer. 

10 Whatman paper was soaked briefly in transfer buffer. Five pieces of 
Whatman paper, membrane, gel, and another five pieces of Whatman 
paper were arranged on the bottom cathode (anode) of the Pharmacia 
Novablot transfer apparatus (Pharmacia). Transfer was for one hour at 
constant current (2 mA/cm 2 ). 

15 Transfer was confirmed by checking for the appearance of 

the prestained standards on the membrane. Non-specific sites on the 
membrane were blocked by incubating the blot for thirty minutes in IX 
Phosphate Buffered Saline (IX PBS; 137 mM NaCl, 2.7 mM KC1, 8 mM 
Na 2 HP0 4 , 1.5 mM KH 2 P0 4 , pH 7.4) with 5% skim milk powder 

20 (Carnation). Primary antibody (Rabbit a-ricin, Sigma) was diluted 1:3000 
in IX PBS containing 0.1% Tween 20 (Sigma) and 2.5% skim milk and 
incubated with blot for forty five minutes on a orbital shaker (VWR). 
Non-specifically bound primary antibody was removed by washing the 
blot for ten minutes with IX PBS containing 0.2% Tween 20. This was 

25 repeated four times. Secondary antibody donkey anti-rabbit (Amersham) 
was incubated with the blot under the same conditions as the primary 
antibody. Excess secondary antibody was washed as described above. 
Blots were developed with the ECL Western Blotting detection reagents 
according to the manufacturer's instructions. Blots were exposed to 

30 Medtec's Full Speed Blue Film (Medtee) or Amersham 1 s ECL Hyperfilm 
(Amersham) for one second to five minutes. Film was developed in a 
KODAK Automatic Developer. 

Determination of lectin binding ability of pro-ricin variant 
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An Immulon 2 plate (VDVR) was coated with 100 \il per 
well of 10[ig/ml of asialofetuin and left overnight at 4°C. The plate was 
washed with 3X 300 (LiL per well with ddH 2 0 using an automated plate 
washer (BioRad). The plate was blocked for one hour at 37°C by adding 
5 300 (iL per well of PBS containing 1% ovalbumin. The plate was washed 
again as above. Pro-ricin variant pAP-protein was added to the plate in 
various dilutions in IX Baculo. A standard curve of RCA 60 (Sigma) from 1- 

10 ng was also included. The plate was incubated for 1 h at 37°C. The plate 
was washed as above. Anti-ricin monoclonal antibody (Sigma) was diluted 

10 1:3000 in IX PBS containing 0.5% ovalbumin and 0.1% tween-20, added at 
100 |iL per well and incubated for 1 h at 37°C The plate was washed as 
above. Donkey-anti rabbity polyclonal antibody was diluted 1:3000 in IX 
PBS containing 0.5% ovalbumin, 0.1% Tween-20, and added at 100|iL per 
well and incubated for 1 h at 37°C. The plate was given a final wash as 

15 described above. Substrate was added to plate at 100(xL per well (1 mg/ml 
o-phenylenediamine (Sigma), 1 pL/ml H 2 0 2 , 25 |LiL of stop solution (20% 
H2SO4) was added and the absorbance read (A490nm-A630nm) using a 
SPECTRA MAX 340 plate reader (Molecular Devices). 

Determination of pAP -Protein activity using the rabbit reticulocyte assay 
20 Ricin samples were prepared for reduction. 

A) RCA 60 = 3,500 ng/jxL of RCA 60 + 997 (xL IxEndo buffer 
(25mM Tris, 25mM KCl,5mM MGC1 2 , pH 7.6) 
Reduction = 95 jiL of lOng/juL + 5 jxL p-mercaptoethanol 

B) Ricin variants 

25 Reduction = 40 |xL variant + 2 |aL P-mercaptoethanol 

The ricin standard and the variants were incubated for 30 
minutes at room temperature. 
Ricin - Rabb it Reticuloc yte lysate reaction 

The required number of 0.5 mL tubes were labelled. (2 
30 tubes for each sample, + and - aniline). To each of the sample tubes 20 \iL 
of IX endo buffer was added, and 30 j^L of buffer was added to the 
controls. To the sample tubes either 10 |iL of 10ng/|xL Ricin or 10|xL of 
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variant was added. Finally, 30|iL of rabbit reticulocyte lysate was added to 
all the tubes. The samples were incubated for 30 minutes at 30°C using the 
thermal block. Samples were removed from the eppendorf tube and 
contents added into a 1.5 mL tube containing 1 mL of TRIZOL (Gibco). 
5 Samples were incubated for 15 minutes at room temperature. After the 
incubation, 200 jiiL of chloroform was added, and the sample was vortexed 
and spun at 12,000 g for 15 minutes at 4°C. The top aqueous layer from 
the samples was removed and contents added to a 1 mL tube containing 
500 jxL of isopropanol. Samples were incubated for 15 minutes at room 

10 temperature and then centrifuged at 12,000 for 15 minutes at 4°C. 
Supernatant was removed and the pellets were washed with 1 mL of 70% 
ethanol. Centrifugation at 12,000 g for 5 minutes at 4°C precipitated the 
RNA. All but approximately 20 |xL of the supernatant was removed and 
air dried. Pellets from the other samples (+aniline samples) were dissolved 

15 in 20 ilL of DEPC treated ddH 2 0. An 80 jjL aliquot of 1 M aniline (distilled) 
with 2.8 M acetic acid was added to these RNA samples and transferred to 
a fresh 0.5 mL tube. The samples were incubated in the dark for 3 minutes 
at 60°C RNA was precipitated by adding 100 iiL of 95% ethanol and 5^iL 
of 3M sodium acetate, pH 5.2 to each tube and centrifuging at 12,000 g for 

20 30 minutes at 4°C Pellets were washed with 1 mL 70% ethanol and 
centrifuged again at 12,000g for 5 minutes at 4°C to precipitate RNA. The 
supernatant was removed and air dried. These pellets were dissolved in 
IOjxL of 0.1 X E buffer. To all samples, 10 |iL of formamide loading dye 
was added. The RNA ladder (8 |i,L of ladder + 8 \iL of loading dye) was 

25 also included. Samples were incubated for 2 minutes at 70°C on the 
thermal block. Electrophoresis was carried out on the samples using 1.2% 
agarose, 50% formamide gels in 0.1X E buffer + 0.2% SDS. The gel was run 
for 90 minutes at 75 watts. RNA was visualized by staining the gel in 1 
jj,g/|xL ethidium bromide in running buffer for 45 minutes. The gel was 

30 examined on a 302 nm UV box, photographed using the gel 
documentation system and saved to a computer disk. 
Results: 

Protein Expression Yields 
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Aliquots were taken at each stop of the 
harvesting /purification and tested. Yields of functional ricin variant were 
determined by ELISA. Typical results of an 2400 mL prep of infected T. ni 
cells are given below. 



5 Aliquot ;qg pAF220 

Before concentration and dialysis 6000 

After concentration and dialysis 4931 

alpha- Lactose agarose column flow through 219 

alpha- Lactose agarose column elution 1058 
10 

Yield: 1058/6000 - 17.6% 



Purification of pAP -Protein and Western Analysis of column fractions 

Partially purfied pAP-protein was applied to Superdex 75 and 200 
(16/60) columns connected in series in order to remove the contaminating 

15 non-specifically processed pAP-protein. Eluted fractions were tested via 
Western analysis as described above and the fractions containing the most 
pure protein were pooled, concentrated and re-applied to the column. The 
variant was applied a total of three times to the column. Final purified 
pAP-protein has less than 1% processed variant. 

20 The purified pAP-protein was tested for susceptibility to cleavage 

by the particular protease and for activation of the A-chain of the pro-ricin 
variant, (inhibition of protein synthesis). Typically, pAP-protein was 
incubated with and without protease for a specified time period and then 
electrophoresed and blotted. Cleaved pAP will run as two 30 kDa proteins 

25 (B is slightly larger) under reducing (SDS-PAGE) conditions. Unprocessed 
pAP-protein, which contains the linker region, will run at 60 kDa. 
Activation of p AP -Protein variant with Specific Protease 

Activation of protease treated pAP-protein is based on the method 
of May et ah (EMBO Journal. 8 301-8, 1989). Activation of ricin A chain upon 

30 cleavage of the intermediary linker results in catalytic depurination of the 
adenosine 4325 residue of 28S or 26S rRNA. This depurination renders the 
molecule susceptible to amine-catalyzed hydrolysis by aniline of the 
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phosphodiester bond on either side of the modification site. The result is a 
diagnostic 390 base band. As such, reticulocyte ribosomes incubated with 
biochemically purified ricin A chain, released the characteristic RNA 
fragment upon aniline treatment of isolated rRNA (May, M.J. et al. Embo. 
5 Journal, 8:301-308 at 302-303 (1989)). It is on this basis that the assay allows 
for the determination of activity of a ricin A chain which has been cleaved 
from the intact unit containing a particular variant linker sequence. 
EXAMPLE 3 

In Vitro Protease Digestion of Proricin Variants: 

10 Affinity-purified proricin variant is treated with individual disease- 

specific proteases to confirm specific cleavage in the linker region. Ricin- 
like toxin variants are eluted from the lactose-agarose matrix in protease 
digestion buffer (50mM NaCl, 50mM Na-acetate, pH 5.5, ImM 
dithiothreitol) containing lOOmM lactose. Proricin substrate is then 

15 incubated at 37°C for 60 minutes with a disease-specific protease. The 
cleavage products consisting ricin A and B chains are identified using 
SDS/PAGE (Sambrook et al., Molecular Cloning: a Laboratory Manual, 
2nd. ed v Cold Spring Harbor Press, 1989), followed by Western blot 
analysis using anti-ricin antibodies (Sigma). 

20 Cathepsin B may be obtained from Medcor or Calbiochem. 

Matrix metalloproteinases may be prepared substantially as described by 
Lark, M.W. et al. (Proceedings of the 4th International Conference of the 
Imflammation Research Association Abstract 145 (1988)) and Welch, A.R. et al. 
(Arch. Biochem. Biophys. 324:59-64 (1995)). Candida acid protease may be 

25 prepared substantially as described in Remold, H.H. et al. (Biochim. Biophys. 
Acta 167:399-406 (1968)), Ray, T.L. and Payne, CD. (Infect Immunol 58:508- 
514 (1990)) and Fusek, M. et al. (FEBS Lett 327:108-112 (1993)). Hepatitis A 
protease may be prepared as described in Jewell, D.A. et al. (Biochemistry 
31:7862-7869 (1992)). Plasmodium proteases may be prepared as described 

30 in Goldberg, D.E. et al. (/. Exp. Med. 173:961-969 (1991)) and Cooper, J.A. 
and Bujard, H. (Mol Biochem. Parasitol. 56:151-160 (1992)). 
In Vitro Cytotoxicity Assay: 

Human ovarian cancer cells (e.g. MAI 48) are seeded in 96-well flat- 
bottom plates and are exposed to ricin-like toxin variants or control 
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medium at 37°C for 16 h. The viability of the cancer cells is determined by 
measuring [ 35 S]methionine incorporation and is significantly lower in wells 
treated with the toxin variants than those with control medium. 
In Vivo Tumour Growth Inhibition Assay: 
5 Human breast cancer (e.g. MCF-7) cells are maintained in suitable 

medium containing 10% fetal calf serum. The cells are grown, harvested 
and subsequently injected subcutaneously into ovariectomized athymic 
nude mice. Tumour size is determined at intervals by measuring two 
right-angle measurements using calipers. In animals that received ricin- 

10 like toxin variants containing the matrix metalloproteinase-sensitive 
linkers, tumour size and the rate of tumour growth are lower than animals 
in the control group. 
In Vivo Tumour Metastasis Assay: 

The metastasis study is performed substantially as described in 

15 Honn, K.V. et al. (Biochem. Pharmacol 34:235-241 (1985)). Viable B16a 
melanoma tumour cells are prepared and injected subcutaneously into the 
left axillary region of syngeneic mice. The extent of tumour metastasis is 
measured after 4 weeks. The lungs are removed from the animals and are 
fixed in Bouin's solution and macroscopic pulmonary metastases are 

20 counted using a dissecting microscope. In general without therapeutic 
intervention, injection of 10 5 viable tumour cells forms approximately 40- 
50 pulmonary metastases. The number of metastases in animal treated 
with proricin variants containing cathepsin B-sensitive linkers is 
substantially lower. 

25 EXAMPLE 4 

In Vitro Protease Digestion of Proricin Variants by Cancer Proteases 
Cathepsin B or MMP-9 

The general protocol for proricin digestion by cancer proteases is 
described in Examples 2 and 3. 
30 In Vitro Protease Digestion of Cathepsin B Proricin Variant 

Affinity-purified mutant proricin is treated with individual disease- 
specific proteases to confirm specific cleavage in the linker region. The 
proricin substrate is digested in a Cathepsin B protease buffer (50 mM 
Sodium acetate, 2 mM EDTA, 0.05% Triton) at 40°C. Two hours and 
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overnight (16 hr) digestion reactions are carried out using lOOng of 
proricin substrate and 100 and 618 ng of Cathepsin B protease per reaction 
(CALBIOCHEM, USA). The cleavage products of proricin (ricin A and B 
chains) are identified using SDS/PAGE (Sambrook et al v Molecular cloning: 
5 a laboratory Manual, 2nd. ed., Cold Spring Harbor Press, 1989), followed 
by Western blot analysis using anti-ricin antibodies (Sigma). 
In Vitro Protease Digestion of MMP-9 Proricin Variant 

Affinity-purified mutant proricin is treated with individual disease- 
specific proteases to confirm specific cleavage in the linker region. The 

10 proricin substrate is digested in IX column buffer (100 mM NaCl, 50 mM 
Tris, PH 7.5) at 37°C. Two hours and overnight (16 hr) digestion reactions 
are set up using 50 ng of MMP-9 proricin substrate and 20 and 200 ng of 
MMP-9 protease per reaction (CALBIOCHEM, USA). The cleavage 
products of proricin (ricin A and B chains) are identified using SDS/PAGE 

15 (Sambrook et al., Molecular cloning: a laboratory Manual, 2nd. ed., Cold 
Spring Harbor Press, 1989), followed by Western blot analysis using anti- 
ricin antibodies (Sigma). 

The protocol for Western analysis of ricin chains is described in 
Example 2. 

20 Results 

Figures 48 and 49 illustrate Western blots showing the cleavage of 
the protease-sensitive linkers by cathepsin B (pAP 214) and MMP-9 (pAP 
220) respectively. Without protease digestion, the proricin variant appears 
as a single band at approximately 60 kDa (Lane B of Figure 48 and Lane A 

25 of Figure 49). Wild type ricin A chain and B chain appear as two disparate 
bands at approximately 30 kDa (Lane A of Figure 48 and Lane E of Figure 
49). Increasing extent of proricin cleavage can clearly be observed with 
increasing protease concentration (Lanes C and D of Figure 48 and Lanes 
B-C of Figure 49). 

30 EXAMPLE 5 

In vitro protease digestion of various proricin variants by their 
corresponding proteases. 
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The general protocol for proricin digestion by coresponding 
proteases was as desribed in Examples 2 and 3 and should be considered in 
connection with the digestions described below. 

Cleavage of pAP-222 protein with the Matrix Metalloproteinase 2 
5 (MMP-2) 

Affinity-purified mutant proricin is treated with individual disease- 
specific proteases to confirm specific cleavage in the linker region. 

The pAP-222 protein sample (1.0 ug) was digested with the MMP-2 
protease (1.0 ug) overnight at 37° C. The total volume of the digestion 
10 reaction was 21.5 ul, and 0.250 ug of the reaction sample was loaded on a 
protein gel. The MMP-2 protease was purchased from 
S Calbiochem-Novabiochem Corporation, USA. 

Hi Cleavage of pAP-248 protein with the Human Cytomegalovirus (HCMV) 

H s protease 

15 Affinity-purified mutant proricin is treated with individual disease- 

specific proteases to confirm specific cleavage in the linker region. 
pi The pAP-248 protein sample (1.19 ug) was digested with the HCMV 

Hp protease (1.13 ug) overnight at 37°C. The total volume of the digestion was 

V 10.5 ul, and 0.279 ug of the reaction sample was loaded on a protein gel. 

O 20 The HCMV was purchased from BACHEM Bioscience Inc., USA. 

Cleavage of pAP-256 protein with the Hepatitis A virus 3C (HAV 3C) 
protease 

Affinity-purified mutant proricin is treated with individual disease- 
specific proteases to confirm specific cleavage in the linker region. 

25 The pAP-256 protein sample (1.26 ug) was digested with the HAV 

3C protease (5 ug) overnight at 37°C. The total volume of the digestion 
was 12.5 ul, and 0.302 ug of the digestion sample was loaded on a protein 
gel. The HAV 3C protease was a gift from Dr. G. Lawson from Bates 
Collage, Main, USA. 

30 Cleavage of pAP-270 protein with the Matrix Metalloproteinase 2 
(MMP-2) 

Affinity-purified mutant proricin is treated with individual disease- 
specific proteases to confirm specific cleavage in the linker region. 
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The pAP-270 protein sample (0.120 ug) was digested with the MMP-2 
protease (0.25 ug) overnight at 37° C. The total volume of the digestion 
reaction was 22.5 ul, and 0.106 ug of the reaction sample was loaded on a 
protein gel. The MMP-2 protease was purchased from 
5 Calbiochem-Novabiochem Corporation, USA. 

Cleavage of pAP-288 protein with tPA plasminogen tissue activator 

Affinity-purified mutant proricin is treated with individual disease- 
specific proteases to confirm specific cleavage in the linker region. The 
pAP-288 protein sample (1.65 ug) was digested with the t-PA protease (0.5 

10 ug) overnight at 37° C. The total volume of the digestion reaction was 55 
ul, and 0.6 ug of the reaction sample was loaded on a protein gel. The t-PA 
was purchased from Sigma Chemical Co., USA. 
Cleavage of pAP-294 protein with human neutraphil elastase 

Affinity-purified mutant proricin is treated with individual disease- 

15 specific proteases to confirm specific cleavage in the linker region. 

The pAP-256 protein sample (0.6 ug) was digested with the Elastase 
protease (5 ug) at 25° C for one hour. The total volume of the digestion 
reaction was 52.5 ul, and 0.171 ug of the digestion sample was loaded on a 
protein geL The Human Neutrophil Elastase protease was purchased from 

20 Cedarlane Laboratories Limited, Canada. 
Cleavage of pAP-296 protein with calpain 

Affinity-purified mutant proricin is treated with individual disease- 
specific proteases to confirm specific cleavage in the linker region. The 
pAP-296 protein sample (2.05 ug) was digested with the Calpain protease 

25 (10 ug) overnight at 37° C. The total volume of the digestion reaction was 
35 ul and 0.761 ug of the reaction sample was loaded on a protein gel. The 
Calpain protease was purchased from Sigma Chemical Co., USA 
Results 

Figures 52, 54, 58 & 66(MMP-2), 60, 64 and 62 show the cleavage of 
30 proteases of linkers by HCMV, HAV 3C, MMP-2, t-PA, calpain, and human 
neutraphil elastase respectively. Without protease digestion, the proricin 
variants appear as a single band at approximately 60kDA (Lane A in 
connection with Figure 52; Lane B of Figure 54; Lane A of Figure 58; Lane B 
of Figure 60; and Lane C of Figure 62; lane B of Figure 64 and lane B of 
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Figure 66). Wild type ricin chain A and B appear as two bands at 
approximately 30kDA (see for example Lanes C and D of Figure 52) 
proricin cleavage can clearly be obvserved with the appearance of 30kDA 
bands in connection with the protein which has been digested by the 
5 respective protease (see Lane B of Figure 52; Lane C of Figure 54; or Lane 
B of Figure 58 for examples). 
EXAMPLE 6 

In Vitro Translation Assay (Activation by Cancer Proteases Cathepsin B 
or MMP-9 

10 The general protocol for the rabbit retoculocyte lysate reaction to 

test the cytotoxicity of cancer protease-activiated proricin is described 
briefly in Example 3 and is described in more detail in Example 2. 
Results 

Activation of pAP 214 and pAP 220 proricin variants by cathepsin B 
15 and MMP-9, based on the method of May et al. (EMBO J. 8:301-308, 1989), 
is illustrated in Figures 50 and 51 respectively. The appearance of the 390 
base pair product (positive control) is observed in Lane F of Figure 50 and 
Lane G of Figure 51. This 390 base pair product is absent in the negative 
control lanes. Without cathepsin or MMP-9 activation, no or minimal N- 
20 glycosidase activity in the pAP 214 variant (Lanes H to L, Figure 50) or the 
pAP 220 variant (Lanes A to E, Figure 51) was observed. When the pAP 
214 variant and the pAP 220 variant were activated by cathepsin or MMP-9 
respectively, appearance of the 390 base pair product was observed in a 
proricin concentration-dependent manner (Lanes A to E of Figure 50 and 
25 Lanes H to L of Figure 51). The present experimental series demonstrated 
the successful and selective activation of proricin variants by cancer- 
associated proteases. 
EXAMPLE 7 

The general protocol for the rabbit retoculocyte lysate reaction is 
30 described briefly in Example 3 and is described in more detail in Example 2, 
all of which compliments the description below. 

Depurination of Rabbit Reticulocyte 28S Ribosomal RNA by Digested 
and Undigested Ricin Variants 
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Affinity-purified mutant proricin mutants which were previously 
digested with the disease-specific protease, were reduced with 5% 
2-mercaptoethanol then diluted to lOOng, 14.2ng,2.0ng,291pg, and 41.7pg 
with 1 X ENDO buffer(25mM Tris pH 7,6, 25mM KC1, 5mM MgCl 2 ) and 

5 incubated with rabbit reticulocyte lysate, untreated (Promega) for 
30minutes at 30(C. To compare the digested with the undigested proricin 
variant, the proricin in digestion buffer (according to the specific digestion 
protocol) was treated in the same manner as the digested sample. As a 
positive and negative control, lOng of ricin A chain and 1 X ENDO buffer 
10 consecutively, was incubated with rabbit reticulocyte lysate, untreated, for 
30 min at 30°C 

Aniline Cleavage of rRNA and Gel Fractionation 

Total RNA was then extracted from reticulocyte lysate translation 
mixtures with Trizol reagent (Gibco-BRL) as per manufacturer's 
15 instructions. The RNA was incubated with 80ul of 1M aniline (distilled) 
with 2.8M acetic acid for 3 min at 60(C in the dark. Ethanol-precipitated 
RNA samples were dissolved in 20ul of 50% formamide, 0.1X E buffer 
(3.6mM Tris, 3mM NaH 2 P0 4 , 0.2mM EDTA), and 0.05% xylene cyanol. 

lOul of this was heated to 70 (C for 2 minutes, loaded and electrophoresed 
20 in 1.2% agarose, 0.1X E buffer, and 50% formamide gel with RNA running 
buffer (0.1 X E buffer, 0.2% SDS). 
Results 

Activation of pAP-248 proricin variant by HCMV; pAP-256 by 
HAV3C protease; pAP-270 by MMP-2 protease; pAP-288 by t-PA protease; 

25 pAP-294 by human neutrophil elastase; pAP-296 by calpain; and pAP-222 
by MMP-2 is illustrated in Figures 52, 55, 59, 61, 63, 65, and 67 respectively. 
The appearance of the 390 base pair product (deposit of control) is 
obverved in lane L of Figures 53, 55, 61, 63, 65 and 67. The 390 base pair 
product is observed in lane A of Figures 59 (activation of pAP-270 by 

30 MMP-2). This 390 base pair product is absent in the negative control lanes. 
Without the specific protease activation, no or minimal activity is seen in 
the lanes which contained only the proricin variant without digestion (see 
lane A, B, C, D, and E of Figures 53, 55, 61, 63, 65, and 67). The same 
observation is made in connection with p AP-270 in Figure 59, however, the 
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undigested lanes appear as H, I, J, K and L. When the variant was 
activated by its respective protease, there is an appearance of the 390 base 
pair product in a proricin concentration-dependent manner (see Lanes H, I, 
J, K and L of Figure 53, 55, 61, 63, 65, and 67 and Lanes A, B, C, D, and E of 
5 Figure 59), The present experimental series demonstrate the successful 
and selective activation of the identified proricin variants by selective 
corresponding proteases. 
EXAMPLE 8 

Procedure for Examin in g the Cytotoxicity of Ricin and Ricin Variants on 
10 the COS-1 Cell Line 
Cell Preparation 

After washing with 1XPBS (0.137 M NaCl, 2.68 mM KC1, 810 mM 
Na 2 HP0 4 , 1.47 mM KH 2 P0 4 ), cells in log phase growth were removed 
from plates with IX trypsin/EDTA (Gibco/BRL). The cells were 

15 centrifuged at 1100 rpm for 3 min, resuspended in Dulbecco's Modified 
Eagle Medium containing 10%FBS and IX pen/ strep, and then counted 
using a haemocytometer. They were adjusted to a concentration of 5 X 10 4 
cells •ml -1 . One hundred microliters per well of cells was added to wells 2B - 
2G through to wells 9B - 9G of a Falcon 96 well tissue culture plate. A 

20 separate 96 well tissue culture plate was used for each sample of Ricin or 
Ricin variant. The plates were incubated at 37(C with 5% C0 2 for 24 hours* 
Toxin Preparation 

The Ricin and Ricin variants were sterile filtered using a 0.22jxm 
filter (Millipore). The concentration of the sterile samples were then 

25 quantified by A 2 go an d confirmed by BCA measurements (Pierce). For the 

variants digested with the protease in vitro, the digests were carried out as 
described in the digestion procedure for each protease. The digests were 
then diluted in the 1000 ng^mH dilution and sterile filtered. The Ricin and 
the undigested pAP214 in the pAP 214 cytotoxicity data were treated in the 
30 same manner but without the Cathepsin B treatment. Ricin and Ricin 
variants were serially diluted to the following concentrations: 1000 
ng^mH, 100 ng^mT 1 , 10 ng«ml 4 , 1 ng^mH, 0.1 ng*mH, 0.01 ng*mH, 
0.001 ng^ml -1 with media containing 10%FBS and IX pen/strep. 
Application of Toxin or Variants to Plates 
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Columns 2 to 9 were labeled: control, 1000 ng # ml _1 , 100 ng^ml -1 , 
10 ng^mH, 1 ng»mH, 0.1 ng^rnH, 0.01 ng«mH, 0.001 ng*mH 
consecutively. The media was removed from all the sample wells with a 
multichannel pipettor. For each plate of variant and toxin, 50jil of media 
5 was added to wells 2B to 2G as the control, and 50 jxl of each sample 
dilution was added to the corresponding columns. For the pAP220 + 
MMP-9 data, the plates were incubated for one hour at 37(C with 5% C0 2 , 
then washed once and replaced with media, then incubated for 48 hours at 
37(C with 5% C0 2 . For the pAP 214 + Cathepsin B data, the toxin was left 
10 on the plates and incubated for 24 hours at 37(C with 5% C0 2/ then 50 |xl of 

media was added to the wells with the toxin and incubated for another 24 
hours at 37(C with 5% C0 2 . 

Sample Application 

The whole amount of media (and /or toxin) was removed from 
15 each well with a multichannel pipettor, and replaced with 100 \il of the 
substrate mixture (Promega Cell Titer 96 Aqueous Non-Radioactive Cell 
Proliferation Assay Kit). The plates were incubated at 37(C with 5% C0 2 

for 2 to 4 hours, and subsequently read with a Spectramax 340 96 well plate 
reader at 490nm. The IC 50 values were calculated using the GRAFIT 

20 software program. 
Results 

In experiments with pAP-214 and Cathepsin B incubated with COS- 
1 cells, it may be seen that cells incubated with pAP-214 alone, pAP-214 was 
ineffective at causing cell death (see Figure 56). However, the cytotoxicity 

25 of pAP-214 digested with Cathepsin B behaves similarly to the ricin 
control in COS-1 cells. This is also illustrated in Figure 56. Similarily, the 
cytotoxicity of undigested pAP-220 when incubated with COS-1 cells is 
lower than the cytotoxicity observed with COS-1 cells incubated with pAP- 
220 digested with MMP-9. Indeed the results suggest that the toxicity of 

30 digested pAP-220 is greater than that of ricin. (See Figure 57). 
EXAMPLE 9 

Procedure for Examining the Cytotoxicity of Ricin and Ricin Variants on 
Various Tissue Culture Cell Lines 



-75- 

Cell Preparation 

After washing with 1XPBS (1.37M NaCl, 26.8mM KC1, 81mM 
Na 2 HP0 4/ 14.7mM KH 2 P0 4 ), cells in log phase growth were removed 
from plates with IX trypsin/EDTA (Gibco/BRL). The cells were 
5 centrifuged at 1100 rpm for 3 min, resuspended in media containing 
10%FBS and IX pen/ strep (media used depended on the cell line being 
tested), and then counted using a haemocytometer. They were adjusted to 
a concentration of 5 X 10 4 cells •ml -1 (faster growing cell lines were 
adjusted to 2 X10 4 ceils •mH). One hundred microliters per well of cells 
10 was added to wells 2B - 2G through to wells 9B - 9G of a Falcon 96 well 
tissue culture plate. A separate 96 well tissue culture plate was used for 
each sample of Ricin or Ricin variant. The plates were incubated at 37(C 
with 5% C0 2 for 24 hours. 
Toxin Preparation 

15 The Ricin and Ricin variants were sterile filtered using a 0.22|im 

filter (Millipore). The concentration of the sterile samples were then 
quantified by A 280 an d confirmed by a BCA measurement (Pierce). Ricin 

and Ricin variants were serially diluted to the following concentrations: 
3000 ng^mH, 300 ng^mH, 30 ng^mH, 3 ng*mH, 0.3 ng*mH, 
20 0.03ng*mH, 0.003 ng»mH with media containing 10%FBS and IX 
pen/ strep. 

Application of Toxin or Variants to Plates 

Columns 2 to 9 were labeled: control, 0.001 ng*mH, 0.01 ng^mH, 
0.1 ng^ml" 1 , lng*mH, 10 ngTnH, 100 ng*mH, 1000 ng^mH 

25 consecutively. For each plate of variant and toxin, 50|il of media was 
added to wells 2B to 2G as the control, and 50jul1 of each sample dilution 
was added to the corresponding columns containing 100 pi per well of cells 
(i.e. 50 \i\ of the 3000 ng*mH dilution added to the wells B-G in column 9, 
labeled 1000 ng»ml _1 ). The plates were incubated for 48 hours at 37(C 

30 with 5% C0 2 . 

Sample Application 

An amount of 140 (il was removed from each well with a 
multichannel pipettor, and replaced with 100 ^1 of the substrate mixture 
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(Promega Cell Titer 96 Aqueous Non-Radioactive Cell Proliferation Assay 
Kit). The plates were incubated at 37(C with 5% C0 2 for 2 to 4 hours, and 

subsequently read with a Spectramax 340 96 well plate reader at 490nm. 
The IC 50 values were calculated using the GRAFIT software program. 

5 Results 

Referring to Table 2, it may be seen that the survival of cells is 
correlated with the proricin variant and the cell specific protease produced 
by the cell type. For example, in the HT1080 cell line, both pAP-214 and 
pAP-220 required only 2-1/2 times the amount of ricin to achieve the same 

10 level of cytotoxicity. On the other hand, pAP-224 required 193 times the 
amount of ricin to achieve the same level of cell death. As well, it may be 
seen that in the cells where expression of Cathepsin D is found, pAP-214 
and 220 were more effective at causing cell death than ricin and more 
effective than pAP-224. Details concerning the various cells types used in 

15 these experiments are outlined below. 

COS-1 (African Green Monkey Kidney Cells) 

This is an SV40 transformed cell line which was prepared from established 
simian cells CV-1. (Reference: Gluzman, Y. (1975) Cell, 23, 175 - 
182)(ATCC CRL 1650) 
20 HT-1080 Human Fibrosarcoma 

(ATCC CCL 121) This cell line was shown to produce active MMP-9 in 
tissue culture. References: Moore et al. (1997) Gynecologic Oncology 65, 
83-88. 

9L Rat Glioblastoma 

25 Glioblastomas are generally associated with cathepsin B expression. Levels 
of cathepsin B expression correspond to the extent of progression of 
malignancy i.e. highest levels for glioblastomas over anaplastic 
astrocytomas over low-grade gliomas and normal brain tissue. The 9L cell 
line was provided by Dr. William Jia of the B.C. Cancer Agency. 

30 References: Mikkelsen et al. (Aug. 1995) Journal of Neurosurgery 83(2), 
285-290. Nakano et al. (1995) J. of Neurosurgery 83(2), 298-307. 
MCF-7 Human Breast Cancer Cell Line (Epithilial) 
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(ATCC CRL 1555) In the absence of estrogen cathepsin B has not been 
shown to be elevated relative to normal cells. It can be induced with 
estrogen to produce Cathepsin D. Production of MMP-9 is unknown. 

Having illustrated and described the principles of the invention in a 
5 preferred embodiment, it should be appreciated to those skilled in the art 
that the invention can be modified in arrangement and detail without 
departure from such principles. We claim all modifications coming within 
the scope of the following claims. 

All publications, patents and patent applications referred to herein 
10 are incorporated by reference in their entirety to the same extent as if each 
individual publication, patent or patent application was specifically and 
individually indicated to be incorporated by reference in its entirety. 
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FULL CITATIONS FOR CERTAIN REFERENCES REFERRED TO IN 
THE SPECIFICATION 

Bever Jr., C.T., Panitch, H.S., and Johnson, K.P. (1994) Neurology 44(4), 
745-8. Increased cathepsin B activity in peripheral blood mononuclear cells 
5 of multiple sclerosis patients. 

Cohen, P., Graves, H.C., Peehl, D.M., Kamarei, M., Giudice, L.C., and 
Rosenfeld, R.G. (1992) Journal of Clinal Endocrinology and Metabolism 
75(4), 1046-53. Prostate-specific antigen (PSA) is an insulin-like growth 
factor binding protein-3 protease found in seminal plasma. 

10 Conover, C.A. and De Leon, D.D. (1994) J. Biol. Chem. 269(10), 7076-80. 
Acid activated insulin-like growth factor-binding protein-3 proteolysis in 
normal and transformed cells. Role of cathepsin D. 

Hansen, G., Schuster, A., Zubrod, C, and Wahn, V. (1995) Respiration 
62(3), 117-24. Alpha 1-proteinase inhibitor abrogates proteolytic and 
15 secretagogue activity of cystic fibrosis sputum. 

Muller, H.L., Oh, Y., Gargosky, S.E., Lehrnbecher, T., Hintz, R.L., and 
Rosenfeld, R.G. (1993) Journal of Clinical Endocrinology and Metabolism 
77(5), 1113-9. Concentrations of insulin-like growth factor (IGF)-binding 
protein-3 (IGFBP-3), IGF, and IGFBP-3 protease activity in cerebrospinal 
20 fluid of children with leukemia, central nervous system tumor, or 
meningitis. 
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TABLE 1 

Table I - Sequence and Location of Oligonucleotide Primers 



Name of 
Primer 


Primer Sequence t 


Corresponds to 
preproricin 
nucleotide 
numbers: (see 
Figures 8-10) 


Ricin-109 


; 5 ' - GGAGATGAAACCGGGAGGAAATACTATTGTAAT-3 ' 


27 to 59 


Ricin-99Eco 


5'- GCGGAATTCCGGGAGGAAATACTATTGTAAT -3 • 


37 to 59 


Ricin267 


5'- ACGGTTTATTTTAGTTGA-3' 

■f 


300 to 317 


Ricin486 


5'- ACTTGCTGGTAATCTGAG -3' 


519 to 536 


Ricin725 


5'- AGAATAGTTGGGGGAGAC -3' 


758 to 775 


Ricin937 


5 ' - AATGCTGATGTTTGTATG -3' 


970 to 987 


RicinllSl 


5'- CGGGAGTCTATGTGATGA -3' 


1184 to 1201 


Ricinl399 


5 ' -GCAAATAGTGGACAAGTA -3' 


1432 to 1449 


Ricin 1627 


5 ' - GGATTGGTGTTAGATGTG -3 ' 


1660 to 1677 


Ricin 1729C 


5'- ATAACTTGCTGTCCTTTCA -3* 


1864 to 1846 


Ricin 1729C 
Xba 


5'- CGCTCT AGAT AA ^y^T^W~*IY5TY^^ l TT^ x r , 


1864 to 1846 



underlined sequences inserted for subcloning purposes and not included in final preproricin 

sequences 
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Table 2: Comparative Toxicities to Selected Cell Lines of Ricin and 
Ricin Provariants 



Cell Line 

COS-1 

HT1080 

9L 

MCF-7 

(without estrogen) 



IC50Ri c i n 

(ng/ml) 

0.1 
0.5 
10.8 
0.09 



IC50 pA P214 
IC50 Ridn 


ICSOpAPMO 
ICSOiudn 


IC50, 
IC50] 


17 


22 


150 


2.46 


2.14 


193 


1.3 


1.7 


32.3 


27.8 


40 


742 



pAP224 
Ricin 
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I CLAIM: 

1. A purified and isolated nucleic acid having a nucleotide sequence 
encoding an A chain of a ricin-like toxin, a B chain of a ricin-like toxin and a 
heterologous linker amino acid sequence linking the A and B chains, the 

5 heterologous linker sequence containing a cleavage recognition site for a 
protease localized in cells or tissues affected by a specific disease. 

2. The nucleic acid sequence of claim 1 wherein the linker sequence 
encodes a peptide wherein at least 5 amino acids differ from the wild-type. 

3. The nucleic acid sequence of claim 2 wherein the linker sequence 
10 contains a cleavage recognition site recognized by a protease selected from 

the group consisting of: a cancer associated protease, a viral protease, a 
fungal protease, and a parasite protease. 

4. A nucleic acid sequence of claim 3 wherein the A chain is ricin A 
chain, abrin toxin A chain, diphtheria toxin A chain, or Domain I of 

15 Pseudomonas exotoxin. 

5. A nucleic acid molecule of claim 3 wherein the A chain is volkensin 
toxin A chain, cholera toxin A chain, modeccin toxin A chain, viscumin 
toxin A chain or shiga toxin A chain. 

6. A nucleic acid sequence of claim 3 wherein the B chain is ricin B 
20 chain, abrin toxin A chain, diphtheria toxin B chain, or Domain II/III of 

Pseudomonas exotoxin. 

7. A nucleic acid molecule of claim 3 wherein the B chain is volkensin 
toxin B chain, cholera toxin B chain, modeccin toxin B chain, viscumin toxin 
B chain or shiga toxin B chain. 



25 



8. A nucleic acid sequence of claim 3 wherein the cleavage recognition 
site is recognized by a cancer-associated protease which is selected from 
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the group consisting of: cathepsin B, a matrix metalloproteinase, cathespin 
L, cathespin D, urokinase-type plasminogen activator, tissue-type 
plasminogen activator, human prostate-specific antigen, kallikrein, 
neutrophil elastase, and calpain. 

5 9. A nucleic acid sequence of claim 3 wherein the cleavage recognition 
site is recognized by a parasitic protease which is a Plasmodium 
falciparum protease. 

10. A nucleic acid sequence of claim 3 wherein the cleavage recognition 
site is recognized by viral protease which is selected from the group 

10 consisting of: human cytomegalovirus, human herpes virus, varicella 
zoster virus, hepatitis A virus, hepatitis C virus, Epstein-Barr virus-specific 
protease, and infectious laryngotracheitis virus. 

11. A nucleic acid sequence of claim 3 wherein the cleavage recognition 
site is recognized by fungal protease which is a Candida acid protease. 

15 12. A nucleic acid sequence of claim 3 having the nucleotide sequence 
according to SEQ ID No. 3; SEQ ID No 5; SEQ ID No 7; SEQ ID No 9; SEQ 
ID No 11; SEQ ID No 13; SEQ ID No 15; SEQ ID No 17; SEQ ID No 19; 
SEQ ID No 21; SEQ ID No 23; SEQ ID No 25; SEQ ID No 27; SEQ ID No 29; 
SEQ ID No 31; SEQ ID No 33; SEQ ID No 35; SEQ ID No 37; SEQ ID No 39; 

20 SEQ ID No 48; SEQ ID No 50; SEQ ID No 52; SEQ ID No 54; SEQ ID No 74; 
SEQ ID No 77; SEQ ID No 80; SEQ ID No 83; SEQ ID No 86; SEQ ID No 89; 
SEQ ID No 92; SEQ ID No 95; SEQ ID No 98; SEQ ID No 101; SEQ ID No 
104; SEQ ID No 107; SEQ ID No 110; SEQ ID No 113; SEQ ID No 116; 
SEQ ID No 119; SEQ ID No 122; or SEQ ID No 125. 

25 13. A plasmid incorporating the nucleic acid of claim 12. 

14. A baculovirus transfer vector incorporating the nucleic acid of 
claim 12. 
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15. A recombinant protein comprising an A chain of a ricin-like toxin, a 
B chain of a ricin- like toxin and a heterologous linker amino acid sequence, 
linking the A and B chains, wherein the linker sequence contains a cleavage 
recognition site for a protease localized in cells or tissues affected by a 

5 specific disease. 

16. The recombinant protein of claim 15 wherein at least 5 amino acids 
of the linker amino acid sequence differ from the wild-type. 

17. The recombinant protein of claim 16 wherein the linker sequence 
contains a cleavage recognition site which is recognized by a protease 

10 selected from the group consisting of: a cancer, viral, fungal, and a 
parasitic protease. 

18. A recombinant protein of claim 17 wherein the A chain is ricin A 
chain, abrin toxin B chain, diphtheria toxin A chain, or Domain I of 
Pseudomonas exotoxin. 

15 19. A recombinant protein of claim 17 wherein the A chain is volkensin 
toxin A chain, cholera toxin A chain, modeccin toxin A chain, viscumin 
toxin A chain, or shiga toxin A chain. 

20. A recombinant protein of claim 17 wherein the B chain is ricin B 
chain, abrin toxin B chain, diphtheria toxin B chain, or Domain II/III of 

20 Pseudomonas exotoxin. 

21. A recombinant protein of claim 17 wherein the B chain is volkensin 
toxin B chain, cholera toxin B chain, modeccin toxin B chain, viscumin toxin 
B chain, or shiga toxin B chain. 

22. A recombinant protein of claim 17 wherein the cleavage 
25 recognition site is recognized by a cancer-associated protease selected from 

the group consisting of: cathepsin B, a matrix metalloproteinase, cathespin 
L, cathespin D, urokinase-type plasminogen activator, tissue-type 
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plasminogen activator, human prostate-specific antigen, kallikrein, 
neutrophil elastase, and calpain. 

23. A recombinant protein of claim 17 wherein the cleavage 
recognition site is recognized by a parasitic protease which is a 

5 Plasmodium falciparum protease. 

24. A recombinant protein of claim 17 wherein the cleavage 
recognition site is recognized by a viral protease which is selected from the 
group consisting of: human cytomegalovirus, human herpes virus, 
varicella zoster virus, hepatitis A virus, hepatitis C virus, Epstein-Barr 

10 virus-specific protease, and infectious laryngotracheitis virus. 

25. A recombinant protein of claim 17 wherein the cleavage 
recognition site is recognized by a fungal protease which is a Candida acid 
protease. 

26. A recombinant protein of claim 17 having the linker amino acid 
15 sequence according to SEQ ID No. 40; SEQ ID No. 41; SEQ ID No. 42; SEQ 

ID No. 43; SEQ ID No. 44; SEQ ID No. 45; SEQ ID No. 46; SEQ ID No. 55; 
SEQ ID No. 56; SEQ ID No. 57; SEQ ID No. 58; SEQ ID No. 59; SEQ ID No. 
60; SEQ ID No. 61; SEQ ID No. 62; SEQ ID No. 63; SEQ ID No. 64; SEQ ID 
No. 65; SEQ ID No. 66; SEQ ID No. 67; SEQ ID No. 68; SEQ ID No. 69; SEQ 
20 ID No. 70; SEQ ID No. 71; SEQ ID No. 72; SEQ ID No. 75; SEQ ID No. 78; 
SEQ ID No. 81; SEQ ID No. 84; SEQ ID No. 87; SEQ ID No. 90; SEQ ID No. 
93; SEQ ID No. 96; SEQ ID No. 99; SEQ ID No. 102; SEQ ID No. 105; SEQ ID 
No. 108; SEQ ID No. Ill; SEQ ID No. 114; SEQ ID No. 117; SEQ ID No. 120; 
SEQ ID No. 123; or SEQ ID No. 126. 

25 27. A method of inhibiting or destroying cells of a tissue affected by a 
disease, wherein a protease specific to the disease is localized in the cells or 
tissue comprising the steps of: 

(a) preparing a purified and isolated nucleic acid having a 
nucleotide sequence encoding an A chain of a ricin-like toxin, a B chain of a 
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ricin-like toxin, and a heterologous linker amino acid sequence, linking the 
A and B chains, wherein the linker sequence contains a cleavage 
recognition site for the protease; 

(b) introducing the nucleic acid into a host cell and expressing the 
5 nucleic acid in the host cell to obtain a recombinant protein comprising an 

A chain of a ricin-like toxin, a B chain of a ricin-like toxin and a linker amino 
acid sequence; 

(c) suspending the protein in a pharmaceutically acceptable carrier, 
diluent or excipient, and 

10 (d) contacting the cells with the recombinant protein. 

28. A method according to claim 27 wherein the linker sequence 
encodes a peptide wherein at least 5 amino acids differ from the wild-type. 

29. The method of claim 28 where the disease is one of cancer, a fungal 
infection, or cells infected with a virus or parasite. 

15 30. A method of inhibiting or destroying cells of a tissue affected by a 
disease, wherein a protease specific to the disease is localized in the cells or 
tissue comprising the step of contacting the cells with a recombinant 
protein according to claim 26. 

31. A method of treating a disease comprising administering a 
20 recombinant protein according to claim 26 to an animal in need thereof. 

32. A method of treating a disease comprising administering a nucleic 
acid molecule according to claim 12 to an animal in need thereof. 

33. A method of treating a mammal with cancer or infected with a 
fungus, virus or parasite, comprising the steps of preparing a recombinant 

25 protein of claim 15 wherein the linker sequence contains a cleavage 
recognition site for a cancer, fungal, viral or parasitic protease and 
administering the protein to the mammal. 
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34. A method of inhibiting or destroying cells affected by a disease 
associated with a protease, comprising contacting cells affected by a disease 
associated with the protease with a recombinant ricin protein such that 
cells affected by a disease associated with a protease are inhibited or 

5 destroyed, wherein said recombinant ricin protein has an A chain of a ricin- 
like toxin, B-chain of a ricin-like toxin, and a heterologous linker amino acid 
sequence containing a cleavage recognition site for the protease associated 
with the disease. 

35. A method according to claim 33, wherein said heterologous linker 
10 amino acid sequence contains at least five amino acids which differ from 

the wild-type. 

36. The method of claim 34 where the disease is one of cancer, a fungal 
infection, or cells infected with a virus or parasite. 

37. A process for preparing a pharmaceutical for treating a mammal 
15 with cancer, fungal infection, viral infection or parasitic infection, 

comprising the steps of : 

(a) preparing a purified and isolated nucleic acid having a 
nucleotide sequence encoding an A chain of a ricin-like toxin, a B chain of a 
ricin-like toxin, and a heterologous linker amino acid sequence, linking the 

20 A and B chains, wherein the linker sequence contains a cleavage 
recognition site for a cancer, viral or parasitic protease; 

(b) introducing the nucleic acid into a host cell and expressing the 
nucleic acid in the host cell to obtain a recombinant protein comprising an 
A chain of a ricin-like toxin, a B chain of a ricin-like toxin and a linker amino 

25 acid sequence; 

(c) suspending the protein in a pharmaceutically acceptable carrier, 
diluent or excipient. 



38. A pharmaceutical composition for treating cancer or a fungal, or 
viral, or parasitic infection in an animal comprising the recombinant 
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protein of claim 16 and a pharmaceutical^ acceptable carrier, diluent or 
excipient. 

39. A pharmaceutical composition for treating cancer or a fungal, or 
viral, or parasitic infection in an animal comprising the nucleic acid 
molecule of claim 3 and a pharmaceutical^ acceptable carrier, diluent or 
excipient. 
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ABSTRACT OF THE DISCLOSURE 

The present invention provides a protein having an A chain of a 
ricin-like toxin, a B chain of a ricin-like toxin and a heterologous linker 
amino acid sequence, linking the A and B chains. The linker sequence 

5 contains a cleavage recognition site for a disease specific protease such as a 
cancer, fungal, viral or parasitic protease. The invention also relates to a 
nucleic acid molecule encoding the protein and to expression vectors 
incorporating the nucleic acid molecule. Also provided is a method of 
inhibiting or destroying mammalian cancer cells, cells infected with a virus, 

10 a fugus, or parasite, or parasites utilizing the nucleic acid molecules and 
proteins of the invention and pharmaceutical compositions for treating 
human cancer, viral infection, fugal infection, or parasitic infection. 
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Complete Sequence of Baculovirus 
Transfer Vector, pVL1393 

ID PVL1393 preliminary; circular DNA; SYN; 

9632 BP. 

XX 

AC IG1137; 
XX 

DT 01-FEB-1993 (Rel. 7, Created) 

DT 01-JUL-1995 (Rel. 12, Last updated, Version 

1) 

XX 

DE E. coli plasmid vector pVL1393 - complete. 
XX 

KW cloning vector. 
XX 

OS Cloning vector 

OC Artificial sequences; Cloning vehicles. 
XX 

RN [1] 

RC p2Bac from baculovirus 

RC p2Blue from p2Bac 

RC pBlueBac from AcNPV 

RC pBlueBac2 from AcNPV 

RC pBlueBacIII from AcNPV 

RC pBlueBacHisA from AcNPV 

RC pBlueBacHisB from AcNPV 

RC pBlueBacHisC from AcNPV 

RC pVL1392, pVLl393 from pAc3 60 

RA 

RT 

RL The Digest 5:2-2(1992). 
XX 

CC NM (pVL1393) 

CC CM (yes) 

CC NA (ds-DNA) 

CC TP (circular) 

CC ST () 

CC TY (plasmid) 

CC SP (British 

Biotechnology) (Invitrogen) 

CC HO (E.coli NM522) (E.coli 

INValphaF' ) (insect) 

CC CP () 

CC FN (expression) (transfer) 

CC SE () 

CC PA <pAC3 60) 

CC BR (pVL13 92) 

CC OF () 

CC OR () 

XX 

FH Key Location/Qualifiers 
FH 
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/note="l. pAc360, ori/amp/AcMNPV 

-> P VL1393 9632bp" 
0. .0 

/note="TRN AcMNPV" 
868. .868 

/notes" SIT SacII" 
1395 . .1395 
/note= n SIT Apal" 
1901. .1901 
/note="SIT Xhol" 
0. .0 

/note="PRO AcMNPV polyhedrin gene" 
0. .0 

/note="MCS 

BamHI - Smal -Xbal -EcoRI -Not I -anal Il-Pstl- 
0. .0 

/note="ORI E . coli pMBl (ColEl and 
complement ( 0 . . 0 ) 

/note="ANT E. coli beta- lactamase gene 

ampicillin resistance gene (apr/amp) " 

S3 Sequence 9632 BP; 2602 A; 2122 C; 2176 G; 2732 T; 0 
other * 

kagctttact cgtaaagcga gttgaaggat catatttagt tgcgtttatg 
agataagatt gaaagcacgt gtaaaatgtt tcccgcgcgt tggcacaact 
atttacaatg cggccaagtt ataaaagatt ctaatctgat atgttttaaa 
acacctttgc ggcccgagtt gtttgcgtac gtgactagcg aagaagatgt 
gtggaccgca gaacagatag taaaacaaaa ccctagtatt ggagcaataa 
tcgatttaac caacacgtct aaatattatg atggtgtgca ttttttgcgg 
gcgggcctgt tatacaaaaa aattcaagta cctggccaga ctttgccgcc 
tgaaagcata gttcaagaat ttattgacac ggtaaaagaa tttacagaaa 
agtgtcccgg catgttggtg ggcgtgcact gcacacacgg tattaatcgc 
accggttaca tggtgtgcag atatttaatg cacaccctgg gtattgcgcc 
gcaggaagcc atagatagat tcgaaaaagc cagaggtcac aaaattgaaa 
gacaaaatta cgttcaagat ttattaattt aattaatatt atttgcattc 
tttaacaaat actttatcct attttcaaat tgttgcgctt cttccagcga 
accaaaacta tgcttcgctt gctccgttta gcttgtagcc gatcagtggc 
gttgttccaa tcgacggtag gattaggccg gatattctcc accacaatgt 
tggcaacgtt gatgttacgt ttatgctttt ggttttccac gtacgtcttt 
tggccggtaa tagccgtaaa cgtagtgccg tcgcgcgtca cgcacaacac 
cggatgtttg cgcttgtccg cggggtattg aaccgcgcga tccgacaaat 
ccaccacttt ggcaactaaa tcggtgacct gcgcgtcttt tttctgcatt 
atttcgtctt tcttttgcat ggtttcctgg aagccggtgt acatgcggtt 
tagatcagtc atgacgcgcg tgacctgcaa atctttggcc tcgatctgct 
tgtccttgat ggcaacgatg cgttcaataa actcttgttt tttaacaagt 
tcctcggttt tttgcgccac caccgcttgc agcgcgtttg tgtgctcggt 
gaatgtcgca atcagcttag tcaccaactg tttgctctcc tcctcccgtt 
gtttgatcgc gggatcgtac ttgccggtgc agagcacttg aggaattact 
tcttctaaaa gccattcttg taattctatg gcgtaaggca atttggactt 



FT 


misc.feature 


FT 




polyhedrin gene 


FT 




FT 


transposon 


FT 




FT 


misc_binding 


FT 




FT 


misc__ binding 


FT 




FT 


mi s c_b i nd ing 


FT 




FT 


promoter 


FT 




FT 


mi s c_b i nd ing 


FT 




FT 




Bglll" 


FT 


rep„origm 


FT 




pBR322) " 


FT 


CDS 


FT 




(bla) 


FT 
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cataatcagc tgaatcacgc cggatttagt aatgagcact gtatgcggct 
gcaaatacag cgggtcgccc cttttcacga cgctgttaga ggtagggccc 
ccattttgga tggtctgctc aaataacgat ttgtatttat tgtctacatg 
aacacgtata gctttatcac aaactgtata ttttaaactg ttagcgacgt 
ccttggccac gaaccggacc tgttggtcgc gctctagcac gtaccgcagg 
ttgaacgtat cttctccaaa tttaaattct ccaattttaa cgcgagccat 
tttgatacac gtgtgtcgat tttgcaacaa ctattgtttt ttaacgcaaa 
ctaaacttat tgtggtaagc aataattaaa tatgggggaa catgcgccgc 
tacaacactc gtcgttatga acgcagacgg cgccggtctc ggcgcaagcg 
gctaaaacgt gttgcgcgtt caacgcggca aacatcgcaa aagccaatag 
tacagttttg atttgcatat taacggcgat tttttaaatt atcttattta 
ataaatagtt atgacgccta caactccccg cccgcgttga ctcgctgcac 
ctcgagcagt tcgttgacgc cttcctccgt gtggccgaac acgtcgagcg 
ggtggtcgat gaccagcggc gtgccgcacg cgacgcacaa gtatctgtac 
accgaatgat cgtcgggcga aggcacgtcg gcctccaagt ggcaatattg 
gcaaattcqa aaatatatac agttgggttg tttgcgcata tctatcgtgg 
cgttgggcat gtacgtccga acgttgattt gcatgcaagc cgaaattaaa 
tcattgcgat tagtgcgatt aaaacgttgt acatcctcgc ttttaatcat 
gccgtcgatt aaatcgcgca atcgagtcaa gtgatcaaag tgtggaataa 
tgttttcttt gtattcccga gtcaagcgca gcgcgtattt taacaaacta 
gccatcttgt aagttagttt catttaatgc aactttatcc aataatatat 
tatgtatcgc acgtcaagaa ttaacaatgc gcccgttgtc gcatctcaac 
acgactatga tagagatcaa ataaagcgcg aattaaatag cttgcgacgc 
aacgtgcacg atctgtgcac gcgttccggc acgagctttg attgtaataa 
gtttttacga agcgatgaca tgacccccgt agtgacaacg atcacgccca 
aaagaactgc cgactacaaa attaccgagt atgtcggtga cgttaaaact 
attaagccat ccaatcgacc gttagtcgaa tcaggaccgc tggtgcgaga 
agccgcgaag tatggcgaat gcatcgtata acgtgtggag tccgctcatt 
agagcgtcat "gtttagacaa gaaagctaca tatttaattg atcccgatga 
ttttattgat aaattgaccc taactccata cacggtattc tacaatggcg 
gggttttggt caaaatttcc ggactgcgat tgtacatgct gttaacggct 
ccgcccacta ttaatgaaat taaaaattcc aattttaaaa aacgcagcaa 
gagaaacatt tgtatgaaag aatgcgtaga aggaaagaaa aatgtcgtcg 
acatgctgaa caacaagatt aatatgcctc cgtgtataaa aaaaatattg 
aacgatttga aagaaaacaa tgtaccgcgc ggcggtatgt acaggaagag 
gtttatacta aactgttaca ttgcaaacgt ggtttcgtgt gccaagtgtg 
aaaaccgatg tttaatcaag gctctgacgc atttctacaa ccacgactcc 
aagtgtgtgg gtgaagtcat gcatctttta atcaaatccc aagatgtgta 
taaaccacca aactgccaaa aaatgaaaac tgtcgacaag ctctgtccgt 
ttgctggcaa ctgcaagggt ctcaatccta tttgtaatta ttgaataata 
aaacaattat aaatgctaaa tttgtttttt attaacgata caaaccaaac 
gcaacaagaa catttgtagt attatctata attgaaaacg cgtagttata 
atcgctgagg taatatttaa aatcattttc aaatgattca cagttaattt 
gcgacaatat aattttattt tcacataaac tagacgcctt gtcgtcttct 
tcttcgtatt ccttctcttt ttcatttttc tcctcataaa aattaacata 
gttattatcg tatccatata tgtatctatc gtatagagta aattttttgt 
tgtcataaat atatatgtct tttttaatgg ggtgtatagt accgctgcgc 
atagtttttc tgtaatttac aacagtgcta ttttctggta gttcttcgga 
gtgtgttgct ttaattatta aatttatata atcaatgaat ttgggatcgt 
cggttttgta caatatgttg ccggcatagt 

acgcagcttc ttctagttca attacaccat tttttagcag caccggatta 
acataacttt ccaaaatgtt gtacgaaccg ttaaacaaaa acagttcacc 
tcccttttct atactattgt ctgcgagcag ttgtttgttg ttaaaaataa 
cagccattgt aatgagacgc acaaactaat atcacaaact ggaaatgtct 
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ctgtcccgat ttatttgaaa cactacaaat taaaggcgag ctttcgtacc 
aacttgttag caatattatt agacagctgt gtgaagcgct caacgatttg 
cacaagcaca atttcataca caacgacata aaactcgaaa atgtcttata 
tttcgaagca cttgatcgcg tgtatgtttg cgattacgga ttgtgcaaac 
acgaaaactc acttagcgtg cacgacggca cgttggagta ttttagtccg 
gaaaaaattc gacacacaac tatgcacgtt tcgtttgact ggtacgcggc 
gtgttaacat acaagttgct aacgtaatca tggtcatagc tgtttcctgt 
gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 
taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt 
gcgttgcgct cactgcccgc tttccagtcg ggaaacctgt cgtgccagct 
gcattaatga atcggccaac gcgcggggag aggcggtttg cgtattgggc 
gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 
cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 
atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 
ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc 
ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 
cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg 
tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 
ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 
ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc 
aacccggtaa 

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga 
gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 
cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag 
ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc 
gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 
agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa 
aggatcttca cctagatcct tttaaattaa aaatgaagtt ttaaatcaat 
ctaaagtata tatgagtaaa cttggtctga cagttaccaa tgcttaatca 
gtgaggcacc tatctcagcg atctgtctat ttcgttcatc catagttgcc 
tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 
tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct 
gcaactttat ccgcctccat ccagtctatt aattgttgcc gggaagctag 
agtaagtagt tcgccagtta atagtttgcg caacgttgtt gccattgcta 
caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc attcagctcc 
ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 
cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc 
atgccatccg taagatgctt ttctgtgact ggtgagtact caaccaagtc 
attctgagaa tagtgtatgc ggcgaccgag ttgctcttgc ccggcgtcaa 
tacgggataa taccgcgcca catagcagaa ctttaaaagt gctcatcatt 
ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 
ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc 
gcaaaaaagg gaataagggc gacacggaaa tgttgaatac tcatactctt 
cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg 
gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggttccgcgc 
acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 
gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtctcgcgc 
gtttcggtga tgacggtgaa aacctctgac acatgcagct cccggagacg 
gtcacagctt gtctgtaagc ggatgccggg agcagacaag cccgtcaggg 
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PTOTJRE 1 (Cont'd) 

atcaatatat agttgctgat atcatggaga taattaaaat gataaccatc 
tcgcaaataa ataagtattt tactgttttc gtaacagttt tgtaataaaa 
aaacctataa atattccgga ttattcatac cgtcccacca tcgggcgcgg 
atcccgggta ccttctagaa ttccggagcg gccgctgcag atctgatcct 
ttcctgggac ccggcaagaa ccaaaaactc actctcttca aggaaatccg 
taatgttaaa cccgacacga tgaagcttgt cgttggatgg aaaggaaaag 
agttctacag ggaaacttgg acccgcttca tggaagacag cttccccatt 
gttaacgacc aagaagtgat ggatgttttc cttgttgtca acatgcgtcc 
cactagaccc aaccgttgtt acaaattcct ggcccaacac gctctgcgtt 
gcgaccccga ctatgtacct catgacgtga ttaggatcgt cgagccttca 
tgggtgggca gcaacaacga gtaccgcatc agcctggcta agaagggcgg 
cggctgccca ataatgaacc ttcactctga gtacaccaac tcgttcgaac 
agttcatcga tcgtgtcatc tgggagaact tctacaagcc catcgtttac 
atcggtaccg actctgctga agaggaggaa attctccttg aagtttccct 
ggtgttcaaa gtaaaggagt ttgcaccaga cgcacctctg ttcactggtc 
cggcgtatta aaacacgata cattgttatt agtacattta ttaagcgcta 
gattctgtgc gttgttgatt tacagacaat tgttgtacgt attttaataa 
ttcattaaat ttataatctt tagggtggta tgttagagcg aaaatcaaat 
gattttcagc gtctttatat ctgaatttaa atattaaatc ctcaatagat 
ttgtaaaata ggtttcgatt agtttcaaac aagggttgtt tttccgaacc 
gatggctgga ctatctaatg gattttcgct caacgccaca aaacttgcca 
aatcttgtag cagcaatcta gctttgtcga tattcgtttg tgttttgttt 
tgtaataaag gttcgacgtc gttcaaaata ttatgcgctt ttgtatttct 
ttcatcactg tcgttagtgt acaattgact cgacgtaaac acgttaaata 
aagcttggac atatttaaca tcgggcgtgt tagctttatt aggccgatta 
tcgtcgtcgt cccaaccctc gtcgttagaa gttgcttccg aagacgattt 
tgccatagcc acacgacgcc tattaattgt gtcggctaac acgtccgcga 
tcaaatttgt agttgagctt tttggaatta tttctgattg cgggcgtttt 
tgggcgggtt tcaatctaac tgtgcccgat tttaattcag acaacacgtt 
agaaagcgat ggtgcaggcg gtggtaacat ttcagacggc aaatctacta 
atggcggcgg tggtggagct gatgataaat ctaccatcgg tggaggcgca 
ggcggggctg gcggcggagg cggaggcgga ggtggtggcg gtgatgcaga 
cggcggttta ggctcaaatg tctctttagg caacacagtc ggcacctcaa 
ctattgtact ggtttcgggc gccgtttttg gtttgaccgg tctgagacga 
gtgcgatttt tttcgtttct aatagcttcc aacaattgtt gtctgtcgtc 
taaaggtgca gcgggttgag gttccgtcgg cattggtgga gcgggcggca 
attcagacat cgatggtggt ggtggtggtg gaggcgctgg aatgttaggc 
acgggagaag gtggtggcgg cggtgccgcc ggtataattt gttctggttt 
agtttgttcg cgcacgattg tgggcaccgg cgcaggcgcc gctggctgca 
caacggaagg tcgtctgctt cgaggcagcg cttggggtgg tggcaattca 
atattataat tggaatacaa atcgtaaaaa tctgctataa gcattgtaat 
ttcgctatcg tttaccgtgc cgatatttaa caaccgctca atgtaagcaa 
ttgtattgta aagagattgt ctcaagctcg ccgcacgccg ataacaagcc 
ttttcatttt tactacagca ttgtagtggc gagacacttc gctgtcgtcg 
acgtacatgt atgctttgtt gtcaaaaacg tcgttggcaa gctttaaaat 
atttaaaaga acatctctgt tcagcaccac tgtgttgtcg taaatgttgt 
ttttgataat ttgcgcttcc gcagtatcga cacgttcaaa aaattgatgc 
gcatcaattt tgttgttcct attattgaat aaataagatt gtacagattc 
atatctacga ttcgtcatgg ccaccacaaa tgctacgctg caaacgctgg 
tacaatttta cgaaaactgc aaaaacgtca aaactcggta taaaataatc 
aacgggcgct ttggcaaaat atctatttta tcgcacaagc ccactagcaa 
attgtatttg cagaaaacaa tttcggcgca caattttaac gctgacgaaa 
taaaagttca ccagttaatg agcgaccacc caaattttat aaaaatctat 
tttaatcacg gttccatcaa caaccaagtg atcgtgatgg actacattga 
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FTCIJRE 1 (Cont'd) 

cgcgtcagcg ggtgttggcg ggtgtcgggg ctggcttaac tatgcggcat 
cagagcagat tgtactgaga gtgcaccata tgcggtgtga aataccgcac 
agatgcgtaa ggagaaaata ccgcatcagg cgccattcgc cattcaggct 
gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg ctattacgcc 
agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca 
gggttttccc agtcacgacg ttgtaaaacg acggccagtg cc 
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FIGURE 2D 



10 20 30 40 50 

1(111 
1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
C<jCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGCTTAAATCGAGAATGGTGCCAAATTTTAATGC 
AGCAGTGTCAAAAGAAACGAATTTAGCTCTTACCACGGTTTAAAATTACG 
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FIGURE 2D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTCATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 G AAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 C AG ATCTAGTCTAGTTTTAGCAGCGAC ATC AGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTG'GGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGG C TGAAC AACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTC1*TTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 C TCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GG ACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 3C 
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FIGURE 3D 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TGGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATC AGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TC AACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

8 51 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTCGTCCGAAGCCACAGCAATTTTTTGGACTTATGAATGC 
AGCAGTGTCAAAGCAGGCTTCGGTGTCGTTAAAAAACCTGAATACTTACG 

951 TG ATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
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FIGURE 3D (CONT'D) 



ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

12 51 CAG ATCTAGTCTAGTTTTAGCAGCGAC ATCAGGGAACAGTGGTACCACAC 

GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTG^GCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTG AGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 4C 
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FIGURE 4D 

10 20 30 40 50 

111!) 
1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 'TJCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTXTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT " 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTTCTTTGCGTCCACTGGCATTGTGGCGAAGTTTTAATGC 
AGCAGTGTCAAAAGAAACGCAGGTGACCGTAACACCGCTTCAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
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FIGURE 4P (CONT'D) 

ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGC AAATC AGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATAC ACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTG ATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGAC AGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 5C 
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FIGURE 5D 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCC AAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGAC ATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATA^TTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTCCGCAAGGAATTGCAGGGCAGCGAAATTTTAATGC 
AGCAGTGTCAAAAGAGGCGTTCCTTAACGTCCCGTCGCTTTAAAATTACG 
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FIGURE 5D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATAC AGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTAC AGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

12 01 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 

TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 C AGATCTAGTCTAGTTTTAGCAGCGAC ATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTAC AGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACAC AACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGC ATCCTCTGGCC AACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTG AGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGG TG ACC CAAACCAAATATGGTTACC ATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 6D 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 AC AGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTGATGTGGATGAAAGGGATGTGAGGGAATTTGCTTCTTT 
AGCAGTGTCAAACTACACCTACTTTCCCTACACTCCCTTAAACGAAGAAA 

951 TTTAGCTGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTC 
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FIGURE 6D (CONT'D) 

AAATCGACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAG 

1001 G AAATGGTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAAC 
CTTTACCAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTG 

1051 GCAATACAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTG 
CGTTATGTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGAC 

1101 GACTTTGAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTA 
CTGAAACTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGAT 

1151 CTTACGGGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACT 
GAATGCCCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGA 

1201 GC TGC AACTGATGCCACCCGCTGGC AAATATGGGATAATGGAACCATCAT 
CGACGTTGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTA 

1251 AAATCCC AGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTA 
TTTAGGGTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCAT 

13 01 CCACACTTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTT 
GGTGTGAATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAA 

13 51 CCTACTAATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGG 
GGATGATTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACC 

1401 TCTGTGCTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCA 
AGACACGAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGT 

1451 GTGAAAAGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGT 
CACTTTTCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCA 

1501 CCTCAGCAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGA 
GGAGTCGTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCT 

1551 AACAGTTGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGAT 
TTGTCAACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTA 

1601 GGATGTTCAAGAATGATGGAACCATTTTAAATTTGTAt AGTGGATTGGTG 
CCTACAAGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCAC 

1651 TTAGATGTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCC 
AATCTACACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGG 

1701 TCTCCATGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAG 
AGAGGTACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTC 

1751 ATTACTCTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAA 
TAATGAGAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTT 

1801 TAAAAAGGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCG 
ATTTTTCCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGC 

1851 AATTCCTGCAG 
TTAAGGACGTC 
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FIGURE 7C 
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FIGURE 7D 



10 20 30 40 50 

I 1 I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGAC ATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

2 51 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 

TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 

TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

8 01 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 

AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTCAGGTGGTTCAATTGCAGAATTATGATGAAGAGGATGC 

AGCAGTGTCAAAGTCCACCAAGTTAACGTCTTAATACTACTTCTCCTACG 
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FIGURE 7D (CONT'D) 

9 51 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGC AATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

115* GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGC A 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATC ATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTC AAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGAC AAGTATGGATAGAGGACTGTAGC AGTG AAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTC AG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGAC AGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



37/254 



FIGURE 8A 



primer Ricin-109Eco 



pAP-144 template 

primer MAL-B 1 



I I 



signal 

peptide A chain 



linker 



B chain 



primer 1729C Xba 



primer MAL-B 2 



i. 



PGR mutacenesis 



PCR mutagenesis 



^gel purify 

^EcoRI 
^ gel purify 




MAL-B linker 



pBluescript SK 



L 4-EcoRI 



-Xbal 



Signal 
eptide 
coRI 



LXbal 



pAP-225 



^EcoRl/ Xbal 
^ gel purify vector 



triple ligation 



38/254 



i 



PQ 

oo 

w 

o 

l-H 



C 

!2 

s 

a. 

a. 
H 



O 

a 

o 

a 
o 



3 



CO 

c 

o 

OX) 

5 

£ . 

O 
a* 



'C 



3: 
o 

OX) 



CM 

CO 



e 



*- 5 



Eh < 

< Eh 

O O 

Eh < 

< Eh 

< Eh 

(J o 

< Eh 

o u 

a o 

< Eh 

o o 

o cj 

o o 

Eh < 

< e» 

< Eh 

o o 

o o 

o a 

o CJ 

o o 

Eh < 

Eh < 

< Eh 

o o 

o o 

CJ o 
o o 

Eh < 

Eh < 



39/254 
FTGURE 8C 
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FIGURE 8D 



10 20 30 40 50 

I i I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

5 01 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTTGCCGATTTTCGGGGAATCGGAGGACAATGATGAAGC 
AGCAGTGTCAAAAACGGCTAAAAGCCCCTTAGCCTCCTGTTACTACTTCG 
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PTr.TTRB 8D (CONT'D) 



951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGG AGTCTATGTGATGATCTATGATTGCAATACTGCTGC A 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

12 01 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 

TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATC AGGGAACAGTGGTACCAC AC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACC AACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGAC AAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTC AATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTAC AAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

18 51 TGCAG 
ACGTC 
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FIGURE 9C 




pAP-228 
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FTCTJRE 9D 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 

acaccagccgatggcacgacctttatcgcgtataaagaaagtaggactgt 

401 atcaggaagatgcagaagcaatcactcatcttttcactgatgttcaaaat 
tagtccttctacgtcttcgttagtgagtagaaaaGtgactacaagtttta 

451 cgatatacattcgcctttggtggtaattatgatagacttgaacaacttgc 
gctatatgtaagcggaaaccaccattaatactatctgaacttgttgaacg 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTCAGGTGGTTACAGGGGAAGCGATATCAGTTACTATGGC 
AGCAGTGTCAAAGTCCACCAATGTCCCCTTCGCTATAGTCAATGATACCG 
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FTCURE 9D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGC AAATATGGGATAATGGAACC ATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 C^GATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACC ACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTC AG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTAC AAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACC AAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE mr 
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figure inn 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 

CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

8 01 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 

AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTGCTTTGGAGAGAACGTTCCTGTCGTTCCCTACTAATGC 
AGCAGTGTCAAACGAAACCTCTCTTGCAAGGACAGCAAGGGATGATTACG 
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FTHTTRF. 1QP (CONT^D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGC AAATATGGGATAATGG AACC ATC ATAAATCC 
TqACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACAC AACCTTTTGTTAC AACC ATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCC AAACCAAATATGGTTACC ATTATTTTGATAGAC AG ATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGC AAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 
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FIGURE 11 A 
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FIGURE 11C 
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FIGURE IIP 

10 20 30 40 50 

I I I i I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
^CGCCCACGGTGACACGTTTCGATG TG TTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTC AACC AAAC GGATATTTGG TTGC C AAAT AAAATC AACTTGAG AG T 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTC ACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATG ATAG ACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTC AGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATC ATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTAAATTCCAAGATATGCTAAATAATTCACAGC ATCAGGC 
AGCAGTGTCAAATTTAAGGTTCTATACGATTTATTAAGTGTCGTAGTCCG 
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FIGURE IIP (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCC ATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

10 01 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTfTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGC AAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCAC AC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAAC ATTTATGCCGTTAGTC AAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAAC AACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTC AG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACC AAATATGGTTACCATTATTTTGATAGAC AGATTACT 

ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

18 01 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 

CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



57/254 



FIGURE 12A 



primer Ricin-109Eco 



pAP- 144 template 

primer HSV-A1 



signal 
peptide 



A chain 



* inker B chain 



primer 1729CXba 



primer HSV1-A2 



PCR mutagenesis 



i 



PCR mutagenesis 



^ gel purify 

^EcoRI 

^ gel purify 





EcoRl 



Xbal 



,EcoRI/XbaI 
gel purify vector 



58/254 



en 
i 



O 
i— i 



U 



2 

u 
H 



I 

> 

CO 
K 

u 

CD 

e 

•H 

a 




do" b 

E-"< 5 



g 

a 



00 



8 a 



Cl, 



< 

I 

> 

CO 

JH- 
a; 
e 

u 



2? » 



< 
Eh 



Eh 
Eh 

O a 

H < 
< 

o a 

< 

v a 

o CJ 

a u 
u o 

Eh < 

o o 
a o 

Eh < 

< eh 
o o 
o a 
u o 

Eh < 

o a 

Eh <J 

e- < 
o a 
o u 
a a 
a u 

Eh < 

o o 

Eh < 



59/254 
FIGURE 12C 
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FIGURE 12D 



10 20 30 40 50 

I I I I 1 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 

AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 

TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGC ACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTTCTGCGCTTGTAAACGCATCGTCGGCACATGTTAATGC 
AGCAGTGTCAAAAGACGCGAACATTTGCGTAGCAGCCGTGTACAATTACG 
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FIGURE 12D (CONT'D) 

9 51 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

12 01 ACTGATGCCACCCGCTGGC AAATATGGGATAATGGAACCATCATAAATCC 

TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGC AGCGAC ATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACAC AACCTTTTGTTAC AACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGC AAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTGCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTC AG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACC AAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGC AAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 13C 
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FTGURE 13D 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCjpTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTGACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGC ACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTTCTACGTATTTACAGGCATCGGAGAAATTTAAGAATGC 
AGCAGTGTCAAAAGATGCATAAATGTCCGTAGCCTCTTTAAATTCTTACG 
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FTGURE 13D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGAC AATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGC AGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACAC AACCTTTTGTTAC AACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGC AGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGC ATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCC ATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 14C 
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FIGURE 14D 



10 20 30 40 50 

I I 1 I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGAC ATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 XCAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCC ACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTTCTCAGGATGTAAACGCAGTGGAGGCAAGTTCTAATGC 
AGCAGTGTCAAAAGAGTCCTACATTTGCGTCACCTCCGTTCAAGATTACG 
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FTGURE 14D (CQNrD) 



951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGOTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCC ACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCAC AC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTAC AGTGCAAACCAACATTTATGCCGTTAGTC AAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGAC AAGTATGGATAGAGGACTGTAGC AGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTQCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACC AAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCC ATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 15C 
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FIGURE 15D 



10 20 30 40 50 

! I i I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTAC ACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATC ATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

9 01 TCGTCACAGTTTTCTGTGTATTTACAGGCATCGACGGGATATGGTAATGC 
AGCAGTGTCAAAAGACACATAAATGTCCGTAGCTGCCCTATACCATTACG 
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FTGURE 15D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCC ACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGC AAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGC AATACTGCTGC A 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
yTGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACC AACATTTATGCCGTT AGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACC ATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGC AAATAGTGGACAAGTATGGATAGAGGACTGTAGC AGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTG AACAACAGTGGGCTCTTTATGC AGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTG ATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGC ATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACC AAATATGGTTACC ATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

18 51 TGCAG 
ACGTC 
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FIGURE 16A 

PCR Mutagenesis of Preproricin Gene to Create an EBV-A Variant 
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FIGURE 16C 
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FTHIIRE 16D 

10 20 30 40 50 

I ! I I t 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACC AAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TC AACTGCAAAGACGTAATGGTTCCAAATTC AGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATC ATAGCTCTCATGGTGTATAGATGCGCACCTCCACC A 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTAAGCTTGTACAGGCATCGGCGTCAGGTGTTAATGC 
AGCAGTGTCAAAAGATTCGAACATGTCCGTAGCCGCAGTCCACAATTACG 
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FIGURE 16D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGC AAATC AGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCG ATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 XCTGATGCCACCCGCTGGCAAATATGGGATAATGGAACC ATC ATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACC ATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGA^GACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAAC AACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGG ATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAG ATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGAC AGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FTCURE 17A 
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FIGURE 17C 
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FIGURE 17D 

10 20 30 40 50 

I I 1 II 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAAC AACATATTCCCCAAAC AATACCC AATTATAAACTTT ACC ACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGC AAAGCTAC ACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGC ATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGC AATTC AAG AGTCTAACCAAGG AGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTCGTATCTAAAGGC ATCGGACGCACCTGATAATGC 
AGCAGTGTCAAAAGAAGCATAGATTTCCGTAGCCTGCGTGGACTATTACG 
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FIGURE 17D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

12 01 ACTGATGCCACCCGCTGGC AAATATGGGATAATGGAACCATCATAAATCC 

^pACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCAC AC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACC AACATTTATGCCGTTAGTC AAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGG ATAGAGGACTGTAGCAGTGAAA 

GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGC ATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACC AAATATGGTTACC ATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 18 A 
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FIGURE 18C 
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FIGURE 18D 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCAC A 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTAT^AAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTC AAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAAC AACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

8 51 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTGGGGTTGTAAATGCATCGTGTAGACTTGCTAATGC 
AGCAGTGTCAAAAGACCCCAACATTTACGTAGCACATCTGAACGATTACG 



91/254 



FIGURE 18D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGC A 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCAC AC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 19A 
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FIGURE igr 
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FIGURE 19D 



10 20 30 40 50 

I I ! I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTC ACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTAC ACAAACTTTATCAGAGCTGTTCGCGG 
w CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACC AACGGTTTATTTTAGTTGAACTCTC A 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGC AATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGC AAAG ACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGC ACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTCGTATGTAAAGGCATCGGTGTCACCTGAAAATGC 
AGCAGTGTCAAAAGAAGCATACATTTCCGTAGCCACAGTGGACTTTTACG 
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FIGURE 19D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGC AAGTCTAATACAGATGC AAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGC A 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGC AAATATGGGATAATGGAACC ATC ATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATC AGGGAACAGTGGTACCAC AC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACC AACATTTATGCCGTTAGTC AAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGAC AAGT ATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCC A 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGAC ATTGTAAATTTTGTAACTGAAAGGAC AGC AAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 20 A 
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FIGURE 20C 
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FIGURE 20D 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CC3CCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAAC AACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACC AGATCCTAGCGTAATTACACTTG AG AATAGTTGGGGGAG A 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

8 51 TATTAATCCCTATC ATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTC ACAGTTTTCTTCGATTTTAAATGCATCGGTGCCAAATTTTAATGC 
AGCAGTGTCAAAAGAAGCTAAAATTTACGTAGCCACGGTTTAAAATTACG 
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FIGURE 20D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GQTACAGTCCGGG AGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCG ACATCAGGGAACAGTGGTACCAC AC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACC AAC ATTTATGCCGTTAGTC AAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGG AC AAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCC AACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 21 



Ricin linker (wild type) : 

A chain- SLLIRPVVPNFN-B chain 

pAP-213/pAP-214 linker (Cathepsin B) : 

A chain- SLLKSRMVPNFN-B chain 

pAP-215/pAP-216 linker (MMP-3) : 

A chain- RPKPQQFFGLMN-B chain 

pAP-217/pAP-218 linker (MMP-7) : 

A chain- SLRPLALWRSFN-B chain 

pAP-219/pAP-220 linker (MMP-9) : 

A chain- SPQGIAGQRNFN-B chain 

pAP-22l/pAP-222 linker (THERMOLYS IN-LIKE MMP.) : 

A chain- DVDERDVRGFASFL-B chain 

pAP-24l/pAP-242 linker (EBV-A) : 

A chain- SKLVQASASGVN-B chain 

pAP-243/pAP-244 linker (EBV-B) : 

A chain- SSYLKASDAPDN-B chain 
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FIGURE 22A 
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FIGURE 22C 
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FIGURE 22D 



10 20 30 40 50 

I I I I 1 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCC AAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGC AATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATC ATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTAAGTATCTACAGGCAAATGAGGTAATTACTAATGC 
AGCAGTGTCAAAAGATTCATAGATGTCCGTTTACTCCATTAATGATTACG 
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FIGURE 22D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGC A 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TfeACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAQGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAAC AACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GT^TTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACG ATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACC ATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCC A 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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PTHITRE 23C 
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FIGURE 23P 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTf CACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTGAGCTTAGAACGCAATCGTTCTCAAATTGGAATGC 
AGCAGTGTCAAAAGACTCGAATCTTGCGTTAGCAAGAGTTTAACCTTACG 
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FIGURE 23D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCAC AACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATAC AGATGCAAATC AGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTAC AACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGC AAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGC AGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTC AATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAG ACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

18 51 TGCAG 
ACGTC 
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FIGURE 24C 
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FIGURE 24D 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTfCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAG AAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTC TTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGG A 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTGAGCTTTGGTCGCAAGGGATCGATGATGATAATGC 
AGCAGTGTCAAAAGACTCGAAACCAGCGTTCCCTAGCTACTACTATTACG 
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PTf^TTHF. 24D f f.QNTD) 

951 TGATGTTTGTATGG ATCCTGAGCCCATAGTGCGTATCGTAGGTCG AAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGC AAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACC ATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTAC AGTGC AAACC AACATTTATGCCGTTAGTC AAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGC AGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAAC AGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAAC AGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATC ATTCTTTACCCTCTCC A 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGC AGTGTGTGTGTCCTGCCATGAAAATAG ATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 25A 
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I I 
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PTCTITRTl 25C 
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FIGURE 25D 



10 20 30 40 50 

I I I I I 

1 GAATTCATGA?IACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTC A 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TCGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCC ACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCC AAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCC AAATTC AGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTAAGCCTGCAAAGTTCTTCAGGCTAAATTTTAATGC 
AGCAGTGTCAAAAGATTCGGACGTTTCAAGAAGTCCGATTTAAAATTACG 
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FIGURE 25D (CONT'D) 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAA*TCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGC AAGTCTAATAC AGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGC AATACTGCTGC A 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACGCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGAC AAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCC AACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTC C TGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 26 



Ricin linker (wild type) : 

A chain- SLLIRPVVPNFN-B chain 

pAP-223/224 linker (MAL-A) : 

A chain- QVVQLQNYDEED-B chain 

pAP-225/226 linker (MAL-B) : 

A chain- LPIFGESEDNDE-B chain 

pAP-227/228 linker (MAL-C) : 

A chain- QVVTGEAISVTM-B chain 

pAP-229/230 linker (MAL-D) : 

A chain- ALERTFLSFPTN-B chain 



pAP-23l/pAP-232 linker (MAL-E) : 

A chain- KFQDMLNISQHQ-B chain 
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FTCURE 27 



Ricin linker (wild type) : 

A chain- SLLIRPVVPNFN-B chain 



pAP-245/246 linker (CMV-A) : 

A chain- SGVVNASCRLAN-B chain 



pAP-247/248 linker (CMV-B) : 

A chain- SSYVKASVSPEN-B chain 



pAP-233/234 linker (HERPES SIMPLEX-1 A) : 

A chain- SALVNASSAHVN-B chain 



pAP-235/236 linker (HERPES SIMPLEX-1 B) : 

A chain- S T Y L Q-A S E K F K N -B chain 

pAP-249/250 linker (HUMAN HERPES VIRUS-6) : 

A chain- SSILNASVPNFN-B chain 



pAP-237/pAP-23 8 linker (VZV-A) : 

A chain- SQDVNAVEASSN-B chain 



pAP-239/pAP-240 linker (VZV-B) : 

A chain- SVYLQASTGYGN-B chain 



pAP-253/pAP-254 linker (ILV) : 

A chain- SKYLQANEVITN-B chain 



pAP-255/pAP-256 linker (HAV-A) : 

A chain- SELRTQSFSNWN-B chain 



pAP-257/pAP-258 linker (HAV-B) : 

A chain- SELWSQGIDDDN-B chain 
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FIGURE 28 

Ricin linger (wild type) : 

A chain- SLLIRPVVPNFN-B chain 

pAP-259/260 linker (CAP- A) : 

A chain- SKPAKFFRLNFN-B chain 

pAP-261/262 linker (CAP-B) : 

A chain- SKPIEFFRLNFN-B chain 



pAP-263/264 linker (CAP-C) : 

A chain- SKPAEFFALNFN-B chain 
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FIGURE 29 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 
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FTCURE 30A 

PCR Mutagenesis of Prcproricin Gene to Create An HCV- A Variant 
Gene in Bacnlovirns Transfer Vector, pVL 1393 

a) Cloning Strategy 



pAP-144 template 
(cut with Eco RI) 



primer Ricin-109Eco 



I I 



signal 
peptide 



HCV-A1 3' primer 



A chain 



WT linker 



HCV-A2 5' primer 
PCR mutagenesis 



gel purify 

EcoRI 
gel purify 



B chain 



primer 1729C Pst I 



PCR mutagenesis 




A chain 




EcoRI 



PstI 



dephosphorylate 
gel purify vector 
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FTCTJRE 30C (PI) 



Sequence of pAP262 insert 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

4 51 CG AT AT AC AT T C GCC T T T GGT GGT AAT TAT GAT AG ACT T GAAC AAC T T GC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGT AAT CT GAG AG AAAAT AT CGAGT TGGGAAAT GGT CC AC T AG AGG AGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTT.CAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 AT T CC AAT AT AT T G AGGG AGAAAT GCGC ACGAGAAT T AGGT AC AACC GGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 
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FIGURE 30C (P2) 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTGATTTGGAGGTAGTGACATCGACATGGGTTTTTAATGC 
AGCAGTGTCAAACTAAAGCTCCATCACTGTAGCTGTACCCAAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACT GATGC CACCCGCT GGCAAAT AT GGGAT AATGGAACC AT C AT AAAT CC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 T T AC AGT GCAAACC AAC ATT TAT GCCGTT AGT CAAGGT T GGC T T CC T AC T 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 C T T GC AAGCAAAT AGT GGACAAGT AT GG AT AGAGG ACT GTAGC AGT G AAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

14 51 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 
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FIGURE 30C (P3) 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 T GGT G ACCC AAAC C AAAT AT G GT T ACC AT TAT T T T GAT AG AC AG AT T AC T 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP262 
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FTCitJRE 30D 

-Amino Acid Sequence Comparison of Mutant 
Preproricin Linker Region of HCV-A to Wild Type 

Wild type Ricin linker: ' A chain- SLLIRPVVPNFN-B chain 

pAP-262 linker: A chain- DLEVVTSTWVFN -B chain 

(HCV-A linker) 
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FIGURE 31 A 

PCR Mutagenesis of Prcproricin Gene to Create An HCV-B Variant 
Gene in Baculovirus Transfer Vector, pVL 1393 



a) Cloning Strategy 
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FIGURE 31C (PI) 

Sequence of pAP264 insert 

10 20 30 40 50 

1 1 I < I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 AT C AGG AAG AT GC AGAAGC AAT C AC T CAT C T T T T C AC T GAT G T T C AAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTT AT AAT TTGCATCCAAATGATTTC AGAAGC AGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 
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FIGURE 31C (P2) 



801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTGATGAGATGGAAGAGTGTGCGTCACACCTTTTTAATGC 
AGCAGTGTCAAACTACTCTACCTTCTCACACGCAGTGTGGAAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 
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FTHTIRF. 31C (P3) 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGAC ATTGT AAAT TT TGTAACTGAAAGGAC AGCAAGT T AT ATCGAAT T CC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP264 
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FIGURE 31 n 

•Amino Acid Sequence Comparison of Mutant 
Preproricin Linker Region of HCV-B to Wild Type 



Wild type Ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-264 linker: A chain- DEMEECASHLFN-B chain 

(HCV-B linker) 
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FIGURE 32A 

- PCR Mutagenesis of Preproricin Gene to Create An HCV-C Variant 
Gene in Bacnlovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



pAP-144 template 
(cut with Eco RI) 



primer Ricin-109Eco 



1 I 



HCV-C1 3' primer 



signal 

peptide A chain 



WT linker 



HCV-C2 5* primer 
PCR mutagenesis 



^gei purify 

^ EcoRI 
^ gel purify 



B chain 



primer 1729CPstI 



PCR mutagenesis 




A chain 




EcoRI 



PstI 



signal 
sptide 
coRI 




iPstl 



pAP 266 



EcoRI/ Pst I 

dephosphorylate 
gel purify vector 
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FTCLTTRF 32C (Fl) 



Sequence of pAP266 insert 



10 20 30 40 50 

I I I I i 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

51 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

4 01 AT C AGGAAGATGC AG AAGC AAT CACT C AT CT TT T CACT GAT GTT C AAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACT AGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGT ACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 
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FTHIJRE 32C (P2) 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

8 51 TATTAATCCCTATCATAGCTCTCATGGTGTAT AGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTGAGGACGTTGTATGTTGTTCGATGTCATATTTTAATGC 
AGCAGTGTCAAACTCCTGCAACATACAACAAGCTACAGTATAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGAJACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCAT AAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 
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FIGURE 32C (P3) 



1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855 • 
Sequence name: PAP266 
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FIGURE 32D 



-Amino Acid Sequence Comparison of Mutant 
Preproricin Linker Region 6f HCV-C to Wild Type 



Wild type Ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-266 linker: A chain- EDVVCCSMSYFN-B chain 

(HCV-C linker)" 
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FTHURE 33A 

PGR Mutagenesis of Preproricin Gene to Create An HCV-D Variant 
Gene in Baculo virus Transfer Vector, pVL 1393 



a) Cloning Strategy 



pAP-144 template 
(cut with EcoRI) 



primer Ricin-109Eco 



I C 



A chain 



HCV-D1 3' primer 



signal . 
peptide A cnain 



WT linker B chain 

HCV-D2 5' primer 
PCR mutagenesis |pCR mutagenesis 



primer 1729CPstI 



gel purify 

EcoRI 
gel purify 




EcoRI 




dephosphorylate 
gel purify vector 
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FTCTTTRF. 33C (PI) 

Sequence of pAP268 insert 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

51 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 AT T C C AAT AT AT T GAGGGAG AAAT GC GC ACG AG AAT T AG GT AC AAC C G G A 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

7 51 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 
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pmTTKF 33C (P2) 



801 TCAACTGCAAAGACGT AATGGTTCCAAATTCAGTGTGTACGATGTGAGT A 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTAAGGGGTGGAGATTGCTAGCGCCAATAACTGCTTATGC 
AGCAGTGTCAAATTCCCCACCTCTAACGATCGCGGTTATTGACGAATACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTT TGGT TGT AAAT ACGGCAATC AGTT CC AACCGAAGGAT GA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACC AT TTT AAAT TTGT AT AGTGGAT TGGT GT T AGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 
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FIGURE 33C (P3) 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP268 
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FIGURE 33D 

-Amino Acid Sequence Comparison of Mutant 
Preproricin Linker Region of HCV-D to Wild Type 



Wild type Ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-268 linker: A chain- KGWRLLAPITAY-B chain 

(HCV-D linker) 
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FTGURE 34A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 



a) Cloning Strategy 



primer Ricin-109Eco 



pAP-144 template 
(cut with Eco RI) 

270-3' primer 



I I 



signal 
peptide 



A chain 



linker 

H 



270-5' primer 



1 



PCR mutagenesis 



^ gel purify 
^ EcoRI 

i 1 



gel purify 



B chain 



primer 1729C PstI 



PCR mutagenesis 




A chain 




EcoRI 



PsO 



dephosphorylate 
gel purify vector 
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FIGURE 34B 

Sequence of MMP-2 Linker Region 

WT preprocin linker 

primer 270-3' 

5 ' - TGGGCTC CTAATTTTAATGCTGATGTTTGT 
| ** ** ★ 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT | CACCATGGTTTAAAATTA 

*** * * ★** 
AGCAGTGTCAAAAGAAACGGGGACCCAAAT - 5 ' 
primer 270-5 ' 

1 ) PCR mutagenesis 

2) Ligate with pVLl 393 



pAP 270 linker 
(MMP-2 variant) 

TCTTTGCCCCTGGGTTTA | TGGGCTCCTAATTTTAAT - 
• AGAAACGGGGACC CAAAT | ACCCGAGGATTAAAATTA 
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FTGURE 34C (PI) 



Sequence of pAP270 insert 



10 20 30 40 50 

I I i I i 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 

AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 

TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 

ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

4 01 AT C AGGAAGATGCAGAAG CAAT C ACT CAT CT TTT C ACTGATGT T C AAAAT 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

4 51 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

6 01 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

6 51 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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FIGURE 34C ( P9) 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 

CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

8 01 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 

AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAAT CC CTAT CATAGCTCTCATGGTGTATAGATGCGCAC CTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGCCCCTGGGTTTATGGGCTCCTAATTTTAATGC 
AGCAGTGTCAAAAGAAACGGGGACCCAAATACCCGAGGATTAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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FIGURE 34C (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP2 7 0 
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FIGURE 34D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of MMP-2 to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-270 (MMP-2) linker: A chain- SLPLGLWAPNFN-B chain 
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FIGURE 35A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 
a) Cloning Strategy 



primer Ricin-109Eco 



pAP-144 template 
(cut with Eco RI) 

272-3' primer 



signal 
peptide 



A chain 



linker 



272-5' primer 



PCR mutagenesis 



^gel purify 
^EcoRI 



gel purify 



B chain 



primer 1729C PstI 



|PCR mutagenesis 




pAP272 Cathepsin B (Site 2) New linker 



A chain. 



Signal 
eptide 

m 



pVL 1393 + new linker 



EcoRI 




dephosphorylate 
gel purify vector 
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FIGURE 35B 

Sequence of Cathepsin B (Site 2) Linker Region 
WT preprocin linker 

primer 272-3' 

5 ' - AGGATGCCAAATTTTAATGCTGATGTTTGT 
| ** * * 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT CACCATGGTTTAAAATTA 



- AGCAGTGTCAAAAGAAACGAATATCGATCT - 5 ' 
primer 272-5' 



1) PCR mutagenesis 

2) Ligate with pVL1393 



pAP 272 linker 
(Cathepsin B Site 2 variant) 

-TCTTTGCTTATAGCTAGA | AGGATGCCTAATTTTAAT - 
- AGAAACGAAT AT CGATCT TCCTACGGATTAAAATTA 
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FIGURE 35C (PI) 

Sequence of pAP272 insert 



10 20 30 40 50 

I I I 1 I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGG^TGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 

AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 

TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

4 01 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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pmTTRF. 35C (P2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGCTTATAGCTAGAAGGATGCCTAATTTTAATGC 
AGCAGTGTCAAAAGAAAGGAATATCGATCTTCCTACGGATTAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTGAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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FTdtJRE 35C (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAAGGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

f 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
AC CACTGGGTTTGGTTTATACCAATGGTAAT AAAACTAT CTGT CTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP272 
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FIGURE 35D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Cathepsin B Site 2 to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP- 272 (Cathepsin B 2) linker: A chain- SLLIARRMPNFN-B chain 
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FIGURE 36A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 



a) Cloning Strategy 



primer Ricin-109Eco 



pAP- 144 template 
(cut with Eco RI) 

274-3' primer 



signal 
peptide 



A chain 



linker 



274-5* primer 



i 



PCR mutagenesis 



^ gel purify 
^EcoRI 



gel purify 



B chain 



17^ 



primer 1729CPstI 



i 



PCR mutagenesis 




pAP274 Cathepsin LNew linker 




EcoRI 



PstI 



EcoRl/PstI 

dephosphorylate 
gel purify vector 
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FTC:iTRE 36B 

Sequence of Cathepsin L Linker Region 
WT preprocin linker 

primer 274-3' 

5 ' - TCATGGGCTAATTTTAATGCTGATGTTTGT 
^ | ******* * 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT j CACCATGGTTTAAAATTA 

*** ** 

AGCAGTGTCAAAAGAAACGAATATAAGGCC - 5 ' 
primer 274-5' 



1) PCR mutagenesis 

2) Ligate with pVLl 393 



pAP 274 linker 
(Cathepsin L variant) 

TCTTTGCTTATATTCCGG | TCATGGGCTAATTTTAAT 
AGAAACGAATATAAGGCC | AGTACCCGATTAAAATTA 
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FTCIIRF, 36C (PI) 



Sequence of pAP274 insert 



10 20 30 40 50 

I I I I 1 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCC^AACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGC3TGCCACTGTGC^AGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

6 01 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

6 51 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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FIGURE 36C (P2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGCTTATATTCCGGTCATGGGCTAATTTTAATGC 
AGCA6TGTCAAAAGAAAGGAATATAAGGCCAGTACCCGATTAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTQGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TT ACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACGATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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FTGURE 36C (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTC ACACACACAGGACGGTACTTTTAT CTAC CGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases 
Sequence name: PAP274 



1855 . 
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FIGURE 36D 



Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Cathepsin L to Wild Type 

Wild type ricin linker: , A chain- SLLIRPVVPNFN-B chain 
pAP-274 (Cathepsin L) linker : A chain- SLLIFRSWANFN-B chain 
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FIGURE 37A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 



a) Cloning Strategy 

primer Ricin-109Eco 



i r 



pAP- 144 template 
(cut with Eco RI) 

276-3' primer 



signal 

peptide A chain 



linker 



276-5 1 primer 



PCR mutagenesis 



^ gel purify 

^EcoRI 

^ gel purify 



B chain 



primer 1729C PstI 



PCR mutagenesis 




pAP276 Cathepsin D New linker 




EcoRI 



Psa 



dephosphorylate 
gel purify vector 
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FIGURE 37B 



Sequence of Cathepsin D Linker Region 
WT preprocin linker 

' primer 276-3' 

5 ' - ACTGTTATTGTTATCACCGCTGATGTTTGT 
j*** ****** * * ** 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGATAACGAATATTCCGG | CACCATGGTTTAAAATTA 

**** * * *** * 
AGCAGTGTCAAAAGACCACAACAGTAGCGA -5' 
primer 276-5' 



1 ) PCR mutagenesis 

2) Ligate with pVLl 393 



pAP 276 linker 
(Cathepsin D variant) 

TCTGGTGTTGTCATCGCT | ACTGTTATTGTTATCACC 
AGACCACAACAGTAGCGA j TGACAATAACAATAGTGG 
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FTCIJRE 37C (Fl) 



Sequence of pAP276 insert 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTtTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACAC CAGC CGATGGGACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTAT CT C AGCGCTTTATTATTACAGT ACTGGTGGCACTCAGCTT CCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTGCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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FTGURE 37C (P2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTGGTGTTGTCATCGCTACTGTTATTGTTATCACCGC 
AGCAGTGTCAAAAGACCACAACAGTAGCGATGACAATAACAATAGTGGCG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGC CATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGT CCGGGAGT CTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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FTGURE 37C (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 C AAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name : PAP276 
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FIGURE 37P 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Cathepsin D to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-276 (Cathepsin D) linker: A chain- SGVVIATVIVIT-B chain 
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FIGURE 38A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP- 144 template 
(cut with Eco RI) 

278-3* primer 



I 



signal 
peptide 



A chain 



linker 



278-5' primer 



i 



PCR mutagenesis 



^gel purify 
^EcoRI 



gel purify 



B chain 



primer 1729C Pst I 



PCR mutagenesis 




pAP278 MMP1 Interstitial Collagenase New Linker 



A chain 




EcoRI 



PstI 



dephosphorylate 
gel purify vector 
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FIGURE 38B 

Sequence of MMP-1 (Interstitial collagenase) Linker Region 

WT preprocin linker 

primer 278-3' 

5 ' - ATTTGGGGACAGTTTAATGCTGATGTTTGT 
* ****** * * 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT | CACCATGGTTTAAAATTA 

** ********* 

AGCAGTGTCAAAAGAAACCCAGGAGTTCCG - 5 ' 
primer 278-5' 



1 ) PCR mutagenesis 

2) Ligate with pVLl 393 



pAP 278 linker 
(MMP-1 variant) 

TCTTTGGGTCCTCAAGGC | ATTTGGGGACAGTTTAAT- 

AGAAACC C AGGAGTTCCG j TAAACCCCTGTCAAATTA 
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FTCIJRE 38C (PI) 



Sequence of pAP278 insert 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
C CGTTGT AC CGAAACAAAACCTAGGTGGAGT CCC ACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TT AGT ACGT C T CGAAAGAC AATGT AAT CGCGAC CT ACAGTGGTT ACGT AT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 

ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

4 01 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTAGAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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FIGURE 38C (P2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGGGTCCTCAAGGCATTTGGGGACAGTTTAATGC 
AGCAGTGTCAAAAGAAACGCAGGAGTTCCGTAAACCCCTGTCAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 

TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGC AAAT AGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 

GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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FIGURE 38C (P3) 

1451 AGGCTGAAC AACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTT CTTACT AC CTTGGT AAAATTTAAACATAT CACCTAACCAC AAT CT A 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP2 78 
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FIGURE 38D 

Figure 38. d) Amino acid sequence Comparison of Mutant Preproricin Linker 
region of MMP-1 (Interstitial collagenase) to Wild Type 

Wild type ricin linker: ' A chain- SLLIRPVVPNFN-B chain 
pAP-278 (MMP-1) linker: A chain- SLGPQGIWGQFN-B chain 
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FIGURE 39A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 



a) Cloning Strategy 

primer Ricin-109Eco 



pAP-144 template 
(cut with Eco RI) 

280-3* primer 



signal 
peptide 



A chain 



* PCR mutagenesis 



linker 



B chain 



280-5* primer 



primer 1729C Pst I 



i 



t 



PCR mutagenesis 



gel purify 

EcoRI 

gel purify 




pAP280 Urokinase-Type Plasminogen Activator New Linker 



A chain 





EcoRI 



PstI 



Pst I 



EcoRI/ PstT 

dephosphorylate 
gel purify vector 
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FIGURE 39B 

Sequence of Urokinase-Type Plasminogen Activator Linker Region 
WT preprocin linker 

primer 280-3' 

5'- GTTGTCGGTGGCTCTGTAGCTGATGTTTGT -3 
* ******* * *** 

- TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT | CACCATGGTTTAAAATTA 

************* *** 

3 ' - AGCAGTGTCAAATTTTTTAGGGGACCTTCT - 5 ' 
* primer 280-5' 



1 ) PCR mutagenesis 

2) LigatewithpVL1393 



pAP 280 linker 
(uPA variant) 

AAAAAATCCCCTGGAAGA | GTTGTCGGTGGCTCTGTA 
TTTTTTAGGGGACCTTCT | CAACAGCCACCGAGACAT 
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FIGURE 39C (PI) 



Sequence of pAP280 insert 



10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTAC7VTACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 

AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCA?y\TAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 

TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

4 01 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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FIGURE 39C (P2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTAAAAAATCCCCTGGAAGAGTTGTCGGTGGCTCTGTAGC 
AGCAGTGTCAAATTTTTTAGGGGACCTTCTCAACAGCCACCGAGACATCG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACGATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGC AAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 

GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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FIGURE 39C (P3) 



1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTC3TCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
^ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

16 01 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases 
Sequence name: PAP2 8 0 



1855 . 
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FIGURE 3QD 

Figure 39. d) Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Urokinase-Type Plasminogen Activator to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-280 (uPA) linker: A chain- KKS PGRVVGGS V-B chain 
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FIGURE 40A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP- 144 template 
(cut with Eco RI) 

282-3' primer 



i r 



signal 

peptide A chain 



linker B chain 

— I 



primer 1729C PstI 



282-5' primer 



PCR mutagenesis 



i 



PCR mutagenesis 



^ gel purify 
^ EcoRI 



gel purify 




pAP282MTl-MMPNew Linker 



pVL 1393 



A chain M 



JJ chain 



-EcoRI 



PstI 



Signal 
eptide 
coRI 



L Pst I 



pVL 1393 + new linker 



EcoRI/ PstI 

dephosphorylate 
gel purify vector 



triple hgation 
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FIGURE 40B 

Sequence of MT-MMP Linker Region 



WT preprocin linker 

primer 282-3' 

5 ' - GCTCCTGGTATTCTTGGCGCTGATGTTTGT 
***★*★★★ * ★ *** 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT | CAC CATGGTTTAAAATTA 

•*• ******** ******* 

AGCAGTG T CAAAGGGGTTC CTGAGGATCC C -5' 
* primer 2 82-5' 



1) PCR mutagenesis 

2) Ligate with pVL1393 



pAP 282 linker 
(MT-MMP variant) 

CCCCAAGGACTCCTAGGG | GCTCCTGGTATTCTTGGC 
GGGGTTCCTGAGGATCCC | CGAGGACCATAAGAACCG 
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FTGURE 40C (PI) 

Sequence of pAP282 insert 



10 20 30 40 50 

I I I i I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTT AACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

3 51 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 AT C AGG AAGATG CAGAAGCAAT CACT CAT CTTTT CACTGATGTT CAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

6 01 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
T AAG GTT AT AT AACT C C CTCTTT ACGCGTG CTCTTAATC CATGTTGG CCT 
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FIGURE 40C (T>2) 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTCCCCAAGGACTCCTAGGGGCTCCTGGTATTCTTGGCGC 
AGCAGTGTCAAAGGGGTTCCTGAGGATCCCCGAGGACCATAAGAACCGCG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGAGTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTAT GTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGT CCGGGAGT CTATGTGATGAT CTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

13 51 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 

TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 

GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 
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FIGURE 40C (P3) 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTT AAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

18 01 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP282 
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FIGURE 40D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of MT-MMP to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-282 (MT-MMP) linker: A chain- PQGLLGAP'GIL G-B chain 
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FIGURE 41A 

PCR Mutagenesis of Prcproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP-144 template 
(cut with Eco RI) 

284-3' primer 



i i: 



signal 

peptide A chain 



linker 



284-5' primer 



PCR mutagenesis 



^gel purify 
^EcoRI 

i 



gel purify 



B chain 



primer 1729c Pst I 



PCR mutagenesis 




pAP284 Stromelysin-3 = MMP1 1 New Linker 



A chain. 
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pVL 1393 + new linker 
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FIGURE 41 C (PI) 

Sequence of pAP2 84 insert 

10 20 30 40 50 

I I I I I 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

5 1 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAG AGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

3 01 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 

TTAG TACGT CTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

4 01 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 

TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

6 01 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATT CCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 
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FIGURE 41C (P2) 



701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTXTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTT 
AGCAGTGTCAAA 

Linker Sequence : 

CACGGCCCCGAGGGTTTAAGAGTGGGATTTTATGAATCTGACGTCATGGG 
GTGCCGGGGCTCCCAAATTCTCACCCTAAAATACTTAGACTGCAGTACCC 

AAGAGGCCATGCTCGTTTAGTTCATGTCGAAGAGCCTCACACT 
TTCTCCGGTACGAGCAAATCAAGTACAC3CAACTCGGAGTGTGA 

94 9 GC 

CG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGT TGTGGCC ATGCAAGTCTAATACAGAt GCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

12 01 ACTG ATGCC ACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 



t 
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FIGURE 41 r ( P*> 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAA GAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCG ATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 
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FIGURE 41D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of MMP-11 (Stromelysin-3) to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-284 (MMP-11) linker: 

A chain- HGPEGLRVGFY ESDVMGRGHARLVHVEEPHT-B chain 
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FIGURE 42A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP- 144 template 
(cut with Eco Rl) 

286-3* primer 



I 



signal 
peptide 



A chain 



linker 



B chain 



286-5* primer 



primer 1729CPstI 



PCR mutagenesis 



PCR mutagenesis 



gel purify 

EcoRI 
gel purify 
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PTCURE 42B 

Sequence of MMP-13 « CoIlagenase-3 Linker Region 
WT preprocin linker 

primer 286-3' 

5'- GGTCAACGAGGCATT GT CGCT GAT GTTT GT 
★★★★ ★ +** 

TCTTTGCTTATAAGGCCA | GT GGT AC CAAATTTTAAT 

AGAAACGAATATTCCGGT [ CACCATGGTTTAAAATTA 

AGCAGTGTCAAACCTGGAGTCCCCGAACGA -5' 
primer 286-5' 



1) PCR mutagenesis 

2) LigatewithpVL1393 



pAP 286 linker 
(MMP-13 variant) 

GGACCTCAGGGGCTTGCT | GGTCAACGAGGCATTGTC 
■CCTGGAGTCCCCGAACGA | CCAGTTGCTCCGTAACAG 
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F Tf2TTHF 42C (PI) 

Sequence of pAP286 insert 



10 20 30 40 50 

I 'I I I I 

1 GAAT T CAT GAAAC CGGGAGGAAAT ACTAT T GTAAT AT GGAT GT AT GCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

51 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGAT AACAACAT AT T CCC CAAAC AAT AC C CAAT T AT AAAC T T T AC CACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 AT CAGGAAGAT GCAGAAGCAAT CAC T CAT CT T T T CAC T GATGT TCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTT TAT AAT T T GCAT CCAAAT GAT T T CAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GAT C T GCACCAGAT C C TAGC GTAAT T ACAC T T GAGAAT AG T T GG GGGAGA 
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FIGURE 42C (P2) 

CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAAT TAGGGATAGTATCGAGAG TACCACATATC TACGCGT GGAGGT GGT 

901 TCGTCACAGTTTGGACCTCAGGGGCTTGCTGGTCAACGAGGCATTGTCGC 
AGCAGTGTCAAACCTGGAGTCCCCGAACGACCAGTTGCTCCGTAACAGCG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

13 01 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 

AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
T TAT TATGTGT T GGAAAACAAT GT TGGTAACAAC C CGATAT ACCAGACAC 

14 01 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 

GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAAC C GAGAT AAT T GC C T T AC AAGT GAT T C T AAT AT AC GGGAAAC AG T 
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pTdTTrcF 42C (F3) 

GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTfAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 

GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

18 01 GGACAT T GTAAAT T T TG TAAC T GAAAGGACAGCAAGT TATAT CGAAT T CC 

CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

18 51 TGCAG 
ACGTC 
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FIGURE 42D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of MJV1P-13 (CoIIagenase-3) to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-286 (MMP-13) linker: A chain- GPQGLAGQRGIV-B chain 



205/254 

FIGURE 43A 

PGR Mutagenesis of Preproricrin Gene to Create A Variant Gene in 
Bacolovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP-144 template 
(cut with Eco RI) 

288-3' primer 



signal 
peptide 



A chain 



linker 



B chain 



primer 1729C PstI 



288-5' primer 
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PCR mutagenesis 
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PCR mutagenesis 



^gel purify 

^EcoRI 
^ gel purify 




pAP288 Tissue-type Plasminogen Activator New Linker 
A chain - - - - Bchain 



ignal 
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pVL 1393 + new linker 




EcoRI 



Psa 



dephosphorylate 
gel purify vector 
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FIGURE 43B 

Sequence of Tissue-type Plasminogen Activator (tPA) Linker Region 
WT preprocin linker 

primer 288-3' 

5'- GGTCGTAAAGCT CTTGAAGCTGAT GTTT GT -3 
******* ** * * 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT | CACCATGGTTTAAAATTA 

******** ********* 

AGCAGT GTCAAAC CGC CTAGACCCGTTT CC -5' 
primer 288-5' 

1) PCR mutagenesis 

2) LigatewithpVL1393 



pAP 288 linker 
(tPA variant) 

GGCGGATCTGGGCAAAGG | GGTCGTAAAGCTCTTGAA 
CCGCCTAGACCCGTTTCC i CCAGCATTT CGAGAACTT 
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FIGURE 43C (Fl) 

Sequence of pAP288 insert 



10 20 30 40 50 

I , I I I i 

1 GAAT TCAT GAAACCGGGAGGAAATAC TAT T GTAAT AT GGAT GT AT GCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

51 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCAAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

2 01 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
T GT CT C AACCAAACGGAT AT T T GG T T GCCAAATAAAAT CAAC T T GAGAG T 

301 AAT CAT GCAGAGC T T T C T G T T AC AT T AGC GC T GGAT G T CAC C AAT G CAT A 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
AC AC C AGC C GAT GGC AC G AC C T T TAT C GC G T AT AAAGAAAGT AGGAC T G T 

4 01 AT CAG GAAGAT GC AGAAG C AAT CAC T CAT C T T T T CAC T GAT G T T C AAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GC TAT AT G T AAGCGGAAAC CAC CAT T AAT AC TAT C T GAAC T T G T T G AAC G 

501 T GGTAAT C T GAGAGAAAAT ATCGAGT TGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
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FTCiURE 43C (P2) 

CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

8 01 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTGGCGGATCTGGGCAAAGGGGTCGTAAAGCTCTTGAAGC 
AGCAGTGTCAAACCGCCTAGACCCGTTTCCCCAGCATTTCGAGAACTTCG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAAT ACACAACC T T T T GT TACAACCAT T GT T GGGCTAT AT GGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CT TGCAAGCAAATAGT GGACAAGT AT GGATAGAGGACT GTAGCAGT GAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

14 51 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAAC C GAGAT AAT T GC C T T ACAAGT GAT T C T AAT AT ACGGGAAACAGT 
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FIGURE 43C (P3) 

GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
AC AAT T C T AGGAGAGAAC AC C GGGAC GT AG GAGACC GG T T GC TAC C T AC A 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1 651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 T GGTGACCCAAACCAAATAT GGTTACCATTAT T T TGATAGACAGAT TACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

17 51 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP288 
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FIGURE 43D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Tissue-type Plasminogen Activator (tPA) to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-288 (tPA) linker: A chain- GGSGQRGRKAL E-B chain 
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PTHTIRE 44 A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 



a) Cloning Strategy 

primer Ricin- 109Eco 



I L 



signal . 
peptide A cnain 



PCR mutagenesis 



^gel purify 

^ EcoRI 
^ gel purify 



pAP- 144 template 
(cut with Eco RI) 

290-3' primer 



linker 



290-5* primer 



B chain 



primer 1 729C Pst I 



PCR mutagenesis 




pAP290 human Prostate-Specific Antigen (PSA) New Linker 



A chain. 



Signal 



eptide 
oRI 



pVL 1393 + new 




EcoRI 



PstI 



EcoRI/ Pst I 
dephosphorylate 
gel purify vector 
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FTCTTRE 44B 

Sequence of human Prostate-Specific Antigen (PSA) Linker Region 

WT preprocin linker 

primer 290-3' 

5'- TCTTCCGATATTTTTAATGCTGATGTTTGT -3 
★ ★ + + * 

TCTTTGCTTATAAGGCCA | GTGGTACCAAATTTTAAT 

AGAAAC GAAT AT T C C G GT | CACCATGGTTTAAAATTA 

AGCAGTGTCAAAAGAAACAGTCGAGAAGAG -5' 
primer 290-5' 



1) PCR mutagenesis 

2) Ligate with pVL1393 



pAP 290 linker 
(PSA variant) 

■TCTTTGTCAGCTCTTCTC | TCTTCCGATATTTTTAAT- 
AGAAACAGT C GAGAAGAG | AGAAGGCTATAAAAATTA 
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FIGURE 44C (PI) 



Sequence of pAP290 insert 



10 20 30 40 50 

I I I I > 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

51 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGAT AACAAC AT AT T C C C C AAAC AAT AC CCAAT T ATAAAC T T T AC CACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

7 01 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
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PTnTTHK 44C (P2) 



CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGTCAGCTCTTCTCTCTTCCGATATTTTTAATGC 
AGCAGTGTCAAAAGAAACAGTCGAGAAGAGAGAAGGCTATAAAAATTACG 

951 TGATGTJTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GT CAACACCGGTACGT TCAGAT TAT GT C T ACG T T TAGT CGAGAC C T GAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

14 51 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
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FIGURE 44C (P3) 



GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

17 01 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

VP 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACAT T GT AAAT T T T G T AAC T GAAAGGACAGCAAG T TAT AT C GAAT T C C 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP2 90 
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FTC:iJRE 44D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of human Prostate-Specific Antigen (PSA) to Wild Type 

Wild type ricin linker: * A chain- SLLIRPVVPNFN-B chain 
pAP-290 (PSA) linker: A chain- SLSALLSSDI FN-B chain 
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FTGURE 45A 

PGR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP-144 template 
(cut with Eco RI) 

292-3' primer 



signal 
peptide 



A chain 



linker 

H 



B chain 



17^ 



primer 1729CPstI 



292-5' primer 



PCR mutagenesis 



PCR mutagenesis 



^ gel purify 

^ EcoRI 
^ gel purify 




A chain 



pAP292 Kallikrein (hK3) New Linker 

B chain 



signal 
eptide 
coRI 



pVL 1393 + new linker 




EcoRI 



PstI 



dephosphorylate 
gel purify vector 
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FIGURE 45B 

Sequence of Kallikrein (hK3) Linker Region 
WT preprocin linker 

primer 2$2-3 # 

5' ~ AWMPC6CTGGCTTTAATGCTGATGTTTGT 

* ** ******* 

TCTTTGCTTATAAGGCCA I GTGGTACCAAATTTTAAT * 

AGAAACGAAXATTCCGGT | CACCATGGTTTAAAATTA 

* * ***** 

-AGCAGTGTCAAAAGAAACGGArCTAAATTT -5* 
prlioer 292-5' 



1) PCR mutagenesis 



2) Ligate with pVL1393 



pAP 292 linker 

(Kallikrein variant) * 

- TCTTTGCCTAGATTTAAAJ ATTATCGGTGGC'm'AAT- 

AGAAACGGATCTAAATTT | TAATAGCCACCGAAATTA 
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FIGURE 45C (PI) 



Sequence of pAP292 insert 



10 20 30 40 50 

I I I I 1 

1 gaattcatgaaaccgggaggaaatactattgtaatatggatgtatgcagt 
cttaagtactttggccctCctttatgataacattatacctacatacgtca 

51 ggcaacatggctttgttttggatccacctcagggtggtctttcacattag 
ccgttgtaccgaaao^aaacctaggtggagtcccaccagaaagtgtaatc 

101 aggataacaacatattccccaaacaatacccaattataaactttaccaca 
tcctattgttgtataaggggtttgttatgggttaatatttgaaatggtgt 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

251 ACAGAGTTGGTTTGCCTATAAACCAACGGTTTATTTTAGTTGAACTCTCA 
TGTCTCAACCAAACGGATATTTGGTTGCCAAATAAAATCAACTTGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCCAAATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
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FTCTTRE 45C (P2) 

CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CTTTCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GftAAGGTGACGTTAAGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGT TTCTGCAT TACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGCCTAGATTTARRATTATCGGTGGCTTTAATGC 
AGCAGTGTCAAAAGAAACGGATCTAAATTTTAATAGCCACCGAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTAXACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGT TTATCACCTGTTCATACCTATCTCCTGACATCGTCACT TT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 



221/254 



PTnTTHF 45C (P3) 

GTTTTGGCTCTATTAACGGaATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGtTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCRACGATGGATGT 
ACARTTCTAGGRGAG7iRCACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1 60 1 TCAAGAATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1 65 1- GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCAAACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAAGATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855. 
Sequence name: PAP292 
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FTGURE 45P 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Kallikrein (hK3) to Wild Type 

Wild type ridn linker: A chain- SLLIRFWPKFN-B chain 

pAP-292 (hK3) linker: A chain- 5 i r « f K I I S G FN -B chain 
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FIGURE 46A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP-144 template 
(cut withEcoRI) 

294-3' primer 



i r 



signal 
peptide 



A chain 



linker 



294-5' primer 



i 



PCR mutagenesis 



^ gel purify 
^ EcoRI 



gel purify 



B chain 



primer 1729CPstI 



PCR mutagenesis 




A chain 



pAP294 Neutrophil Elastase New Linker 

chain 



EcoRI 




EcoRI/ Pst I 

dephosphorylate 
gel purify vector 
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FTGURE 46B 



Sequence of Neutrophil Elastase Linker Region 
WT prrprocin tinker 

primer 294-3* 

5'- GTTCCTGGTAATTTTAATGCTGATGTTTGT 

-TCTTTGCTTATAAGGCCAI CTGCTACCAAKTTWAAT 

-AGAAACGAATATTCCGGT I CACCATGGTTTAAAATTA 

*** +++ * 

AGCAGTGTCAAAAGAAACGAACCGTAACGA -5' 
primer 294-5' 



1) PCR mutagenesis 

2) Ligate with pVL1393 



pAP 294 linker , 
(Neutrophil elastase variant) 

TCTTTGCTTGGCATTGCT 1 GTTCCTGGTAATTTTAAT' 

AGAAACGAACCGTAACGA I CAAGGACCATTMAATTA 
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FTCURE 46C (PI) 



Sequence of pAP294 insert 



10 20 30 40 50 

I I I I t 

1 GAATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 
CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

51 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCRAACAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGAAATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAAACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACAACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCjRCAACGGTT 

251 acagagttggtttgcctataaaccaacggtttattttagttgaactctca 
tgtctcaaccaaacggatatttggttgccaaataaaatcaacttgagagt 

301 aatcatgcagagctttctgttacattagcgctggatgtcaccaatgcata 
ttagtacgtctcgaaagacaatgtaatcgcgacctacagtggttacgtat 

351 tgtggtcggctaccgtgctggaaatagcgcatatttctttcatcctgaca 
acaccagccgatggcacgacctttatcgcgtataaagaaagtaggactgt 

401 atcaggaagatgcagaagcaatcactcatcttttcactgatgttcaaaat 
tagtccttctacgtcttcgttagtgagtagaaaagtgactacaagtttta 

451 cgatatacattcgcctttggtggtaattatgatagacttgaacaacttgc 
gctatatgtaagcggaaaccaccattaatactatctgaacttgttgaacg 

501 tggtaatctgagagaaaatatcgagttgggaaatggtccactagaggagg 
accattagactctcttttatagctcaaccctttaccaggtgatctcctcc 

551 ctatctcagcgctttattattacagtactggtggcactcagcttccaact 
gatagagtcgcgaaataataatgtcatgaccaccgtgagtcgaaggttga 

601 ctggctcgttcctttataatttgcatccaaatgatttcagaagcagcaag 
gaccgagcaaggaaatattaaacgtaggtttactaaagtcttcgtcgttc 

651 attccaatatattgagggagaaatgcgcacgagaattaggtacaaccgga 
taaggttatataactccctctttacgcgtgctcttaatccatgttggcct 

701 gatctgcaccagatcctagcgtaattacacttgagaatagttgggggaga 



226/254 



FTCURE 46C (P2) 



CTAGACGTGGTCTAGGATCGCATTAATGTGAACTCTTATCAACCCCCTCT 

751 CT't'TCCACTGCAATTCAAGAGTCTAACCAAGGAGCCTTTGCTAGTCCAAT 
GAAAGGTGACGTTAftGTTCTCAGATTGGTTCCTCGGAAACGATCAGGTTA 

801 TCAACTGCAAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATC3VTAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTCTTTGCTTGGCATTGCTGTTCCTGGTAATTTTAATGC 
AGCAGTGTCAAAAGAAACGAACCGTAACGACAAGGACCATTAAAATTACG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGAAACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGT TGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACT TT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGACAATACTATTCGATCTAATGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCAAATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGATCTAGTCTAGTTTTAGCAGCGACATCAGGGAACAGTGGTACCACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTT TTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCAAATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
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FIGURE 46C (P3) 



GTTTTGGCTCTATT2ACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGiTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCARCGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGAATGATGGAkCCATTTT^ 

AGTTCTTACTACCTTGGTAAAATTTA2VACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACATUVTO^TTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGTTTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCA7VACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAAAACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGA2VAATAGATGGCTTAAATAAAAA 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAAAGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAAAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855* 
Sequence name: PAP294 
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VTCIJRE 46D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Neutrophil elastase to Wild Type 

Wild type ricin linker: A chain- S L L I RPVVFNFN -B chain 

pAP-294 (Neutrophil elastase) linker; 

A chain- SLLGIAVPGNFH-B chain 
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FIGURE 47A 

PCR Mutagenesis of Preproricin Gene to Create A Variant Gene in 
Baculovirus Transfer Vector, pVL 1393 

a) Cloning Strategy 



primer Ricin-109Eco 



pAP- 144 template 
(cut with Eco RI) 

296-3' primer 



» r 



signal 

peptide A chain 



linker 



296-5' primer 



PCR mutagenesis 



^ gel purify 
^ EcoRI 



gel purify 



B chain 



primer 1729CPstI 



PCR mutagenesis 




pAP296 Calpain New Linker 



A chain M 



Signal 
eptide 
coRI 



pVL 1393 + new linker 




EcoRI 



PstI 



dephosphorylate 
gel purify vector 
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FTGURE 47B 



Sequence of Calpain Linker Region 

WT preprocin linker 

primer 296-3* 

5'- ACTCCTAGAACCCCCCCAGCTCATGTTTOT 

******** ******** 

TCTTTGCTTATAAGGCCA I CTGGTACCAAATTTTAAT 

AGAAACGAATATTCCGGT I CACCATGGTTTAAAATTA 

★ *** *♦ ***** 

-AGCAGTGTCAAAAAAAAGTTTTTATAACAA -5' 
primer 296-5* 



1) PCR mutagenesis 

2) Ligate with pVL1393 



pAP 296 linker 

(Calpain variant) 

TTTTTCAAAAAXAXTGTT | ACTCCTAGAACCCCCCCA 

AAAAAGTTTTTATAACAA I TGAGGATCTTGGGGGGGT 
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FIGURE 47C (PI) 

Sequence of pAP296 insert 



10 20 30 40 50 

i I I I I 

1 G7VATTCATGAAACCGGGAGGAAATACTATTGTAATATGGATGTATGCAGT 

CTTAAGTACTTTGGCCCTCCTTTATGATAACATTATACCTACATACGTCA 

51 GGCAACATGGCTTTGTTTTGGATCCACCTCAGGGTGGTCTTTCACATTAG 
CCGTTGTACCGAAACAAAACCTAGGTGGAGTCCCACCAGAAAGTGTAATC 

101 AGGATAACAACATATTCCCCATyVCAATACCCAATTATAAACTTTACCACA 
TCCTATTGTTGTATAAGGGGTTTGTTATGGGTTAATATTTGA7\ATGGTGT 

151 GCGGGTGCCACTGTGCAAAGCTACACAT^ACTTTATCAGAGCTGTTCGCGG 
CGCCCACGGTGACACGTTTCGATGTGTTTGAAATAGTCTCGACAAGCGCC 

201 TCGTTTAACZ\ACTGGAGCTGATGTGAGACATGATATACCAGTGTTGCCAA 
AGCAAATTGTTGACCTCGACTACACTCTGTACTATATGGTCACAACGGTT 

2 51 ACAGAGTTGGTTTGCCTATAAAC 

TGTCT CAACCAAACGGAT AT T TGGTTGCCAAATAAAAT CAACT TGAGAGT 

301 AATCATGCAGAGCTTTCTGTTACATTAGCGCTGGATGTCACCAATGCATA 
TTAGTACGTCTCGAAAGACAATGTAATCGCGACCTACAGTGGTTACGTAT 

351 TGTGGTCGGCTACCGTGCTGGAAATAGCGCATATTTCTTTCATCCTGACA 
ACACCAGCCGATGGCACGACCTTTATCGCGTATAAAGAAAGTAGGACTGT 

401 ATCAGGAAGATGCAGAAGCAATCACTCATCTTTTCACTGATGTTCAAAAT 
TAGTCCTTCTACGTCTTCGTTAGTGAGTAGAAAAGTGACTACAAGTTTTA 

451 CGATATACATTCGCCTTTGGTGGTAATTATGATAGACTTGAACAACTTGC 
GCTATATGTAAGCGGAAACCACCATTAATACTATCTGAACTTGTTGAACG 

501 TGGTAATCTGAGAGAAAATATCGAGTTGGGAAATGGTCCACTAGAGGAGG 
ACCATTAGACTCTCTTTTATAGCTCAACCCTTTACCAGGTGATCTCCTCC 

551 CTATCTCAGCGCTTTATTATTACAGTACTGGTGGCACTCAGCTTCCAACT 
GATAGAGTCGCGAAATAATAATGTCATGACCACCGTGAGTCGAAGGTTGA 

601 CTGGCTCGTTCCTTTATAATTTGCATCO^AATGATTTCAGAAGCAGCAAG 
GACCGAGCAAGGAAATATTAAACGTAGGTTTACTAAAGTCTTCGTCGTTC 

651 ATTCCAATATATTGAGGGAGAAATGCGCACGAGAATTAGGTACAACCGGA 
TAAGGTTATATAACTCCCTCTTTACGCGTGCTCTTAATCCATGTTGGCCT 

701 GATCTGCACCAGATCCTAGCGTAATTACACTTGAGAATAGTTGGGGGAGA 
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FTOURE 47C (P2) 

ctagacgtggtctaggatcgcattaatgtgaactcttatcaaccccctct 

751 ct'ttccactgcaattcaagagtctaaccaaggagcctttgctagtccaat 
garaggtgacgttaagttctcagattggttcctcggaaacgatcaggtta 

801 TCAACTGCaVAAGACGTAATGGTTCCAAATTCAGTGTGTACGATGTGAGTA 
AGTTGACGTTTCTGCATTACCAAGGTTTAAGTCACACATGCTACACTCAT 

851 TATTAATCCCTATCATAGCTCTCATGGTGTATAGATGCGCACCTCCACCA 
ATAATTAGGGATAGTATCGAGAGTACCACATATCTACGCGTGGAGGTGGT 

901 TCGTCACAGTTTTTTTTCAAAAATATTGTTACTCCTAGAACCCCCCCAGC 
AGCAGTGTCAAAAAARAGTTTTTATAACAATGAGGATCTTGGGGGGGTCG 

951 TGATGTTTGTATGGATCCTGAGCCCATAGTGCGTATCGTAGGTCGAAATG 
ACTACAAACATACCTAGGACTCGGGTATCACGCATAGCATCCAGCTTTAC 

1001 GTCTATGTGTTGATGTTAGGGATGGAAGATTCCACAACGGARACGCAATA 
CAGATACACAACTACAATCCCTACCTTCTAAGGTGTTGCCTTTGCGTTAT 

1051 CAGTTGTGGCCATGCAAGTCTAATACAGATGCAAATCAGCTCTGGACTTT 
GTCAACACCGGTACGTTCAGATTATGTCTACGTTTAGTCGAGACCTGAAA 

1101 GAAAAGAGAC^TACTATTCGATCTABlTGGAAAGTGTTTAACTACTTACG 
CTTTTCTCTGTTATGATAAGCTAGATTACCTTTCACAAATTGATGAATGC 

1151 GGTACAGTCCGGGAGTCTATGTGATGATCTATGATTGCAATACTGCTGCA 
CCATGTCAGGCCCTCAGATACACTACTAGATACTAACGTTATGACGACGT 

1201 ACTGATGCCACCCGCTGGCARATATGGGATAATGGAACCATCATAAATCC 
TGACTACGGTGGGCGACCGTTTATACCCTATTACCTTGGTAGTATTTAGG 

1251 CAGAT C TAG T C TAG T T T T AGCAGC GACAT CAGG GAA.CAGT GGT AC CACAC 
GTCTAGATCAGATCAAAATCGTCGCTGTAGTCCCTTGTCACCATGGTGTG 

1301 TTACAGTGCAAACCAACATTTATGCCGTTAGTCAAGGTTGGCTTCCTACT 
AATGTCACGTTTGGTTGTAAATACGGCAATCAGTTCCAACCGAAGGATGA 

1351 AATAATACACAACCTTTTGTTACAACCATTGTTGGGCTATATGGTCTGTG 
TTATTATGTGTTGGAAAACAATGTTGGTAACAACCCGATATACCAGACAC 

1401 CTTGCAAGCM\ATAGTGGACAAGTATGGATAGAGGACTGTAGCAGTGAAA 
GAACGTTCGTTTATCACCTGTTCATACCTATCTCCTGACATCGTCACTTT 

1451 AGGCTGAACAACAGTGGGCTCTTTATGCAGATGGTTCAATACGTCCTCAG 
TCCGACTTGTTGTCACCCGAGAAATACGTCTACCAAGTTATGCAGGAGTC 

1501 CAAAACCGAGATAATTGCCTTACAAGTGATTCTAATATACGGGAAACAGT 
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FIGURE 47C (P3) 

GTTTTGGCTCTATTAACGGAATGTTCACTAAGATTATATGCCCTTTGTCA 

1551 TGTfTAAGATCCTCTCTTGTGGCCCTGCATCCTCTGGCCAACGATGGATGT 
ACAATTCTAGGAGAGAACACCGGGACGTAGGAGACCGGTTGCTACCTACA 

1601 TCAAGZVATGATGGAACCATTTTAAATTTGTATAGTGGATTGGTGTTAGAT 
AGTTCTTACTACCTTGGTAAAATTTAAACATATCACCTAACCACAATCTA 

1651 GTGAGGCGATCGGATCCGAGCCTTAAACAAATCATTCTTTACCCTCTCCA 
CACTCCGCTAGCCTAGGCTCGGAATTTGT TTAGTAAGAAATGGGAGAGGT 

1701 TGGTGACCCARACCAAATATGGTTACCATTATTTTGATAGACAGATTACT 
ACCACTGGGTTTGGTTTATACCAATGGTAATAT^AACTATCTGTCTAATGA 

1751 CTCTTGCAGTGTGTGTGTCCTGCCATGAAAATAGATGGCTTAAATAAAAA. 
GAGAACGTCACACACACAGGACGGTACTTTTATCTACCGAATTTATTTTT 

1801 GGACATTGTAAATTTTGTAACTGAZ^AGGACAGCAAGTTATATCGAATTCC 
CCTGTAACATTTAftAACATTGACTTTCCTGTCGTTCAATATAGCTTAAGG 

1851 TGCAG 
ACGTC 



Total number of bases is: 1855- 
Sequence name; PAP296 
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PTHTTRE 47D 

Amino acid sequence Comparison of Mutant Preproricin Linker 
region of Calpain to Wild Type 

Wild type ricin linker: A chain- SLLIRPVVPNFN-B chain 

pAP-29E (Calpaiji) linker: A chain- FFKHIVT PRTP P-B chain 
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FIGURE 48 

Cleavage of pAP 214 by Cathepsin B 




A. Ricin standard 

B. pAP214 

C. pAP 21 4 digested with 1 00 ng of Cathepsin B (18 hours) 

D. pAP 21 4 digested with 61 8 ng of Cathepsin B (1 8 hours) 
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FTGURE 49 

Cleavage of pAP 220 with MMP-9 



A B C D E 

-60 kDa 
-30 kDa 

A. pAP220 

B. pAP 220 digested with 200 ng of MMP-9 (1 6 hrs) 

C. pAP 220 digested with 20 ng of MMP-9 (1 6hrs) 

D. pAP 220 digested with 20 ng of MMP-9 (2hrs) 
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T?TCIURE 50 

Activation of pAP 214 



A B C D E F G H I J K L M 




A. 41 .7 pg of pAP 214 digested with Cathepsin B 

B. 291 pg of pAP 214 digested with Cathpepsin B 

C. 2.0 ng of pAP 214 digested with Cathepsin B 

D. 14.2 ng of pAP 214 digested with Cathepsin B 

E. 100 ng of pAP 214 digested with Cathepsin B 

F. Negative control 

G. RicinAchain 

H- 41 .7 pg of pAP 21 4 variant 
I- 291 pg of pAP 21 4 variant 
J - 2.0 ng of pAP 21 4 variant 
K. 14.2 ng of pAP 214 variant 
L. 1 00ng of pAP 21 4 variant 
M. RNA ladder 
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FTHURE 51 

Activation of pAP 220 



ABCDEFGHIJKLM 



390 bp 




A. 48.5 pg of pAP 220 variant 

B. 291 pg of pAP 220 variant 

C. 2.0 ng of pAP 220 variant 

D. 14.3 ng of pAP 220 variant 

E. 1 00 ng of pAP 220 variant 

F. RicinAchain 

G. Negative Control 

H. 48.5 pg of pAP 220 variant digested with MMP-9 

I. 291 pg of pAP 220 variant digested with MMP-9 
J. 2.0 ng of pAP 220 variant digested with MMP-9 
K. 14.3 ng of pAP 220 variant digested with MMP-9 
L. 100 ng of pAP 220 variant digested with MMP-9 
M. RNA ladder 
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FIGURE 52 

Cleavage of pAP-248 Protein by The Human Cytomegalovirus (HCMV) protease 



A B C D 



60 kDa 



30 kDa 





A. pAP-248 (0.279 ug) 

B. pAP-248 protein (0.279 ng) digested with 0.25 jig of the HCMV protease 

C. Ricin standard (20 ng) 

D. Ricin standard (40 ng) 
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FTGURE 53 

Activation of pAP-248 Protein 



ABCDEFGHI JK.L M 




A. 90 ng of pAP-248 variant 

B. 12.8 ng of pAP-248 variant 

C. 1.8 ng of pAP-248 variant 

D. 260 pg pAP-248 variant 

E. 37pgof pAP-248 variant 

F. Negative control 

G. RicinAchain 

H. 37 pg of pAP-248 digested with HCMV protease 

I. 260 pg of pAP-248 digested with HCMV protease 
J. 1.8 ng of pAP-248 digested with HCMV protease 
K. 12.8 ng of pAP-248 digested with HCMV protease 
L. 90 ng of pAP-248 digested with HCMV protease 
M. RNA ladder 
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FIGURE 54 

Cleavage of pAP-256 protein by The Hepatite A Virus 3C (HAV 3C) Protease 

ABC 

60kDa 
30kDa 




A. 
B. 
C. 



Ricin standard (0.250 ug) 
pAP-256 protein (0.378 ug) 

pAP-256 protein digested (0.302 ug) with 1.25 \ig of the HAV 3C protease 
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FIGURE 55 



Activation of pAP-256 Protein 



ABCDEFGHI JKLM 




_ 390 b 
fragment 



A. 100 ng of pAP-256 variant 

B. 14.2 ng of pAP-256 variant 

C. 2.0 ng of pAP-256 variant 

D. 291 pg of pAP-256 variant 

E. 41.7 pg of pAP-256 variant 

F. Negative control 

G. RicinAchain 

H. 41.7 pg of pAP-256 digested with HAV 3C protease 
L 291 pg of pAP-256 digested with HAV 3C protease 
J. 2.0 ng of pAP-256 digested with HAV 3C protease 
K. 14.2 ng of pAP-256 digested with HAV 3C protease 
L. 100 ng of pAP-256 digested with HAV 3C protease 
M. RNA ladder 
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FIGURE fifi 

Cytotoxicity of Digested and Undigested 
pAP 214 with Cathepsin B to COS-1 Cells 

° 214+ CathepsinB 

• 214 neg control 

* ricin control 



l l — I I — I i Hiiiij — I i J 1 1 i 1 1 1 — i i — i 1 1 1 inij 




[Toxin] 





Ricin 


pAP214 


pAP 214 + Cathepsin B 


IC 50 (ng/ml) 


0.11 


1.9 


0.078 


Relative Toxicity 


IX 


17X 


0.7X 
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FIGURE 57 



Cytotoxicity of pAP220 Digested with MMP-9 Compared 
to Freshly Thawed pAP220 and Ricin on COS-1 Cells 



o pAP-220+MMP9 
• pAP-220 
■ Ricin 



CO 

> 



CO 
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80 



60 



40 



20 



I I llllllj I 



I I lllltl| — TTT 




0.001 0.01 



0.1 1 
[Toxin] ng/ml 



10 



100 





Ricin 


pAP220 


pAP 220 + MMP-9 


IC 50 (ng/ml) 


0.31 


6.7 


0.13 


Relative Toxicity 


IX 


22X 


0.4X 
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FIGURE 58 

Cleavage of pAP-270 protein by The Matrix Metalloproteinase 2 (MMP-2) 




A. pAP-270 (0.120 \tg) undigested 

B. pAP-270 (0.120 jig) digested with 0.250 ng MMP-2 

C. Ricin Standard (0.05 ng) 



246 /254 
FTGURE 59 



Activation of pAP-270 protein 



ABCDE F GHIJKLM 




A. 100 ng of digested pAP-270 

B. 14.2 ng of digested pAP-270 

C. 2.0 ng of digested pAP-270 

D. 290 pg of digested pAP-270 

E. 46 ng of digested pAP-270 

F. RicinAchain 

G. Negative control 

H. 46 pg of pAP-270 

I. 290pgof pAP-270 
J. 2.0ngofpAP-270 
K. 14.2 ng of pAP-270 
L. 100ngofpAP-270 
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FTGURE 60 

Cleavage of pAP-288 protein by Plasminogen Tissue Activator (t-PA) 



ABC 




« 60 kDa 

< 30 kDa 



A. Ricin Standard (O.OS^g) 

B. pAP-288 (0.66 ng) undigested 

C. pAP-288 (0.60 Hg) digested with 0.18 ug of t-PA protease 
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FIGURE 61 



Activation of pAP-288 protein 



A BCDEFGHI J KLM 




A. 200ngofpAP-288 

B. 28.4ngofpAP-288 

C. 4.0 ngofpAP-288 

D. 482pgofpAP-288 

E. 83.4 pg of pAP-288 

F. Ricin A chain 

G. Negative control 

H. 83.4 pg of pAP-288 digested with tissue Plasminogen 
Activator (t-PA) 

I. 482 pg of pAP-288 digested with t-PA 
J. 4.0 ng of pAP-288 digested with t-PA 
K. 28.4 ng of pAP-288 digested with t-PA 
L. 200 ng of pAP-288 digested with t-PA 
M. RNA ladder 
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FIGURE 62 
Cleavage of pAP 294 With Human Neutrophil Elastase 




+ 60kDa 



30 kDa 



A. Ricin Standard ( 0.050 jig) 

B. pAP 294 protein ( 0.171 jig) digested with 1.42 jig of Human Neutrophil 
Elastase 

C. pAP 294 protein ( 0.121 \ig) 
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FIGURE 63 

Activation of pAP 294 Protein 



ABC DEFGHI JKLM 




A. 60ngofpAP294 

B. 8..57ngofpAP294 

C. 1.22 ng of pAP294 

D. 175pgofpAP294 

E. 25pgofpAP294 

F. RicinAchain 

G. Negative Control 

H. 360 ng of pAP 294 digested with Human Neutrophil Elastase 

I. 51 ng of pAP 294 digested with Human Neutrophil Elastase 
J. 7.3 ng of pAP 294 digested with Human Neutrophil Elastase 
K. 1.0 ng of pAP 294 digested with Human Neutrophil Elastase 
L. 150 pg of pAP 294 digested with Human Neutrophil Elastase 
M. RNA ladder 
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FIGURE 64 

Cleavage of pAP 296 with Calpain 
ABC 

■ 60kDa 
30kDa 

A. Ricin Standard (0.05 pg) 

B. pAP 296 (0.761 |ig) undigested 

C. pAP 296 (0.761 \ig ) digested with 4.0 ug of Calpain 
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FIGURE 65 
Activation of pAP 296 Protein 



ABCDE FGHIJKLM 




A. lOOngofpAP 296 variant 

B. 14.2 ng of pAP 296 variant 

C. 2.0ngofpAP 296 variant 

D. 290pgofpAP 296 variant 

E. 46 pg of pAP 296 variant 

F. RicinAchain 

G. Negative control 

H. 46 pg of pAP 296 variant digested with Calpain 
L 290pgof pAP296 variant digested with Calpain 
J. 2.0 ng of pAP 296 variant digested with Calpain 
K. 14.2 ng of pAP 296 variant digested with Calpain 
L. 100 ng of pAP 296 variant digested with Calpain 
M. RNA ladder 
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FIGURE 66 



Cleavage of pAP-222 Protein by The Matrix Metalloproteinase 2 (MMP-2) 



ABC 




60 kDa 
30 kDa 



A. Ricin Standard (0.250 ug) 

B. pAP-222 Protein (0.250 ug) 

C. pAP-222 protein ( 0.250 ug) digested with 0.28 ug of MMP-2 
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FIGURE 67 



Activation of pAP-222 Protein 



ABC DEF GHI JKLM 




A. 100 ng of pAP-222 variant 

B. 14.2 ng of pAP-222 variant 

C. 2.0 ng of pAP-222 variant 

D. 291 pg of pAP-222 variant 

E. 41.7pgof pAP-222 variant 

F. Ricin A chain 

G. Ricin A chain 

H. 41.7 pg of pAP-222 digested with MMP-2 

I. 291 pg of pAP-222 digested with MMP-2 
J. 2.0 ng of pAP-222 digested with MMP-2 
K. 14.2 ng of pAP-222 digested with MMP-2 
L. 100 ng of pAP-222 digested with MMP-2 
M. RNA ladder 
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